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1 SEQUENCE LISTING 

2 



3 (1) General Information: < ^^^^ 




5 (i) APPLICANT: Garner, Ian 

6 Dalrymple, Michael A T 

7 Prunkard, Donna E 

8 Foster, Donald C 
9 

10 (ii) TITLE OF INVENTION: Production of Fibrinogen in Transgenic 

11 Animals 
12 

13 (iii) NUMBER OF SEQUENCES: 27 
14 

15 (iv) CORRESPONDENCE ADDRESS: 

16 (A) ADDRESSEE: ZymoGenetics , Inc. 

17 (B) STREET: 4225 Roosevelt Way, N.E. 

18 (C) CITY: Seattle 

19 (D) STATE: WA 

20 (E) COUNTRY: USA 

21 (F) ZIP: 98105 
22 

23 (v) COMPUTER READABLE FORM: 

24 (A) MEDIUM TYPE: Floppy disk 

25 (B) COMPUTER: IBM PC compatible 

26 (C) OPERATING SYSTEM: PC -DOS /MS -DOS 

27 (D) SOFTWARE: Patentln Release #1.0, Version #1.25 
28 

2 9 (vi) CURRENT APPLICATION DATA: 

30 (A) APPLICATION NUMBER: 

31 (B) FILING DATE : 

32 (C) CLASSIFICATION: 
33 

34 (viii) ATTORNEY/ AGENT INFORMATION: 

35 (A) NAME: Parker, Gary E 

36 (B) REGISTRATION NUMBER: 31-648 

3 7 (C) REFERENCE/DOCKET NUMBER: 93-15 
38 

3 9 (ix) TELECOMMUNICATION INFORMATION: 

40 (A) TELEPHONE: 206-547-8080 ext 322 

41 (B) TELEFAX: 206-548-2329 
42 

43 
44 

45 (2) INFORMATION FOR SEQ ID NO : 1 : 
46 

47 (i) SEQUENCE CHARACTERISTICS: 

48 (A) LENGTH: 5943 base pairs 

49 (B) TYPE: nucleic acid 

50 (C) STRANDEDNESS: double 

51 (D) TOPOLOGY: linear 



* 
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52 

53 (ii) MOLECULE TYPE: DNA (genomic) 

54 

55 

56 (vii) IMMEDIATE SOURCE: 

57 (B) CLONE: Human Fibrinogen A-alpha chain 
58 

59 (ix) FEATURE: 

60 (A) NAME/KEY: CDS 

61 (B) LOCATION: j oin (31 . . 84 , 1154.. 1279, 1739.. 1922, 3055.. 3200, 

62 3786.. 5210) 
63 

64 

65 (xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1 : 

66 

67 GTCTAGGAGC CAGCCCCACC CTTAGAAAAG ATG TTT TCC ATG AGG ATC GTC TGC 54 

68 Met Phe Ser Met Arg lie Val Cys 

69 15 
70 

71 CTA GTT CTA AGT GTG GTG GGC ACA GCA TGG GTATGGCCCT TTTCATTTTT 104 

72 Leu Val Leu Ser Val Val Gly Thr Ala Trp 

73 10 15 
74 

75 TCTTCTTGCT TTCTCTCTGG TGTTTATTCC ACAAAGAGCC TGGAGGTCAG AGTCTACCTG 164 
76 

77 CTCTATGTCC TGACACACTC TTAGCTTTAT GACCCCAGGC CTGGGAGGAA ATTTCCTGGG 224 
78 

79 TGGGCTTGAC ACCTCAAGAA TACAGGGTAA TATGACACCA AGAGGAAGAT CTTAGATGGA 284 
80 

81 TGAGAGTGTA CAACTACAAG GGAAACTTTA GCATCTGTCA TTCAGTCTTA CCACATTTTG 344 
82 

83 TTTTGTTTTG TTTTAAAAAG GGCAAGAATT ATTTGCCATC CTTGTACCTA TAAAGCCTTG 404 
84 

85 GTGCATTATA ATGCTAGTTA ATGGAATAAA ACATTTTATG GTAAGATTTG TTTTCTTTAG 464 
86 

87 TTATTAATTT CTTGCTACTT GTCCATAATA AGCAGAACTT TTAGTGTTAG TACAGTTTTG 524 
88 

89 CTGAAAGGTT ATTGTTGTGT TTGTCAAGAC AGAAGAAAAA GCAAACGAAT TATCTTTGGA 584 
90 

91 AATATCTTTG CAGTATCAGA AGAGATTAGT TAGTAAGGCA ATACGCTTTT CCGCAGTAAT 644 
92 

93 GGTATTCTTT TAAATTATGA ATCCATCTCT AAAGGTTACA TAGAAACTTG AAGGAGAGAG 704 
94 

95 GAACATTCAG TTAAGATAGT CTAGGTTTTT CTACTGAAGC AGCAATTACA GGAGAAAGAG 764 
96 

97 CTCTACAGTA GTTTTCAACT TTCTGTCTGC AGTCATTAGT AAAAATGAAA AGGTAAAATT 824 
98 

99 TAACTGATTT TATAGATTCA AATAATTTTC CTTTTAGGAT GGATTCTTTA AAACTCCTAA 884 
100 

101 TATTTATCAA ATGCTTATTT AAGTGTCACA CACAGTTAAG AAATTTGTAC ACCTTGTCTC 944 
102 
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103 CTTTAATTCT CATAACAACT CCATAAAATG GGTCCTAGGA TTTCCATTTG AAGATAAGAA 1004 
104 

105 ACCTGAAGCT TGCCGAAGCC CTGTGTCTGC TCTCCTTAAT CTCTGTGAGA GTGCCATCTC 1064 
106 

107 TTCCTGGGGA CTTGTAGGCA TGCCACTGTC TCCTCTTCTG GCTAACATTG CTGTTGCTCT 1124 
108 

109 CTTTTGTGTA TGTGAATGAA TCTTTAAAG ACT GCA GAT AGT GGT GAA GGT GAC 1177 



110 Thr Ala Asp Ser Gly Glu Gly Asp 

111 20 25 
112 

113 TTT CTA GCT GAA GGA GGA GGC GTG CGT GGC CCA AGG GTT GTG GAA AGA 1225 

114 Phe Leu Ala Glu Gly Gly Gly Val Arg Gly Pro Arg Val Val Glu Arg 

115 30 35 40 
116 

117 CAT CAA TCT GCC TGC AAA GAT TCA GAC TGG CCC TTC TGC TCT GAT GAA 1273 

118 His Gin Ser Ala Cys Lys Asp Ser Asp Trp Pro Phe Cys Ser Asp Glu 

119 45 50 55 
120 

121 GAC TGG GTAAGCAGTC AGCGGGGGAA GCAGGAGATT CCTTCCCTCT GATGCTAGAG 1329 

122 Asp Trp 

123 60 



124 

125 GGGCTCACAG GCTGACCTGA TTGGTCCCAG AAACTTTTTT AAATAGAAAA TAATTGAATA 1389 
126 

12 7 GTTACCTACA TAGCAAATAA AGAAAAGGAA CCTACTCCCA AGAGCACTGT TTATTTACCT 1449 
128 

12 9 CCCCAACTCT GGATCATTAG TGGGTGAACA GACAGGATTT CAGTTGCATG CTCAGGCAAA 1509 
130 

131 ACCAGGCTCC TGAGTATTGT GGCCTCAATT TCCTGGCACC TATTTATGGC TAAGTGGACC 1569 
132 

133 CTCATTCCAG AGTTTCTCTG CGACCTCTAA CTAGTCCTCT TACCTACTTT TAAGCCAACT 1629 
134 

13 5 TATCTGGAAG AGAAAGGGTA GGAAGAAATG GGGGCTGCAT GGAAACATGC AAAATTATTC 1689 
136 

13 7 TGAATCTGAG AGATAGATCC TTACTGTAAT TTTCTCCCTT CACTTTCAG AAC TAC 1744 

138 Asn Tyr 

139 

140 

141 AAA TGC CCT TCT GGC TGC AGG ATG AAA GGG TTG ATT GAT GAA GTC AAT 1792 

142 Lys Cys Pro Ser Gly Cys Arg Met Lys Gly Leu lie Asp Glu Val Asn 

143 65 70 75 
144 

145 CAA GAT TTT ACA AAC AGA ATA AAT AAG CTC AAA AAT TCA CTA TTT GAA 1840 

146 Gin Asp Phe Thr Asn Arg lie Asn Lys Leu Lys Asn Ser Leu Phe Glu 

147 80 85 90 
148 

149 TAT CAG AAG AAC AAT AAG GAT TCT CAT TCG TTG ACC ACT AAT ATA ATG 1888 

150 Tyr Gin Lys Asn Asn Lys Asp Ser His Ser Leu Thr Thr Asn lie Met 

151 95 100 105 110 
152 

153 GAA ATT TTG AGA GGC GAT TTT TCC TCA GCC AAT A GTAAGTATTA 1932 
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154 Glu lie Leu Arg Gly Asp Phe Ser Ser Ala Asn 

155 115 120 
156 

157 CATATTTACT TCTTTGACTT TATAACAGAA ACAACAAAAA TCCTAAATAA ATATGATATC 1992 
158 

159 CGCTTATATC TATGACAATT TCATCCCAAA GTACTTAGTG TAGAAACACA TACCTTCATA 2 052 
160 

161 ATATCCCTGA AAATTTTAAG AGGGAGCTTT TGTTTTCGTT ATTTTTTCAA AGTAAAAGAT 2112 
162 

163 GTTAACTGAG ATTGTTTAAG GTCACAAAAT AAGTCAGAAT TTTGGATTAA AACAAGAATT 2172 
164 

165 TAAATGTGTT CTTTTCAACA GTATATACTG AAAGTAGGAT GGGTCAGACT CTTTGAGTTG 2232 
166 

167 ATATTTTTGT TTCTGCTTTG TAAAGGTGAA AACTGAGAGG TCAAGGAACT TGTTCAAAGA 2292 
168 

169 CACAGAGCTG GGAATTCAAC TCCCAGACTC CACTGAGCTG ATTAGGTAGA TTTTTAAATT 2352 
170 

171 TAAAATATAG GGTCAAGCTA CGTCATTCTC ACAGTCTACT CATTAGGGTT AGGAAACATT 2412 
172 

173 GCATTCACTC TGGGCATGGA CAGCGAGTCT AGGGAGTCCT CAGTTTCTCA AGTTTTGCTT 2472 
174 

175 TGCCTTTTTA CACCTTCACA AACACTTGAC ATTTAAAATC AGTGATGCCA ACACTAGCTG 2532 
176 

177 GCAAGTGAGT GATCCTGTTG ACCCAAAACA GCTTAGGAAC CATTTCAAAT CTATAGAGTT 2592 
178 

179 AAAAAGAAAA GCTCATCAGT AAGAAAATCC AATATGTTCA AGTCCCTTGA TTAAGGATGT 2652 
180 

181 TATAAAATAA TTGAAATGCA ATCAAACCAA CTATTTTAAC TCCAAATTAC ACCTTTAAAA 2712 
182 

183 TTCCAAAGAA AGTTCTTCTT CTATATTTCT TTGGGATTAC TAATTGCTAT TAGGACATCT 2772 
184 

185 TAACTGGCAT TCATGGAAGG CTGCAGGGCA TAACATTATC CAAAAGTCAA ATGCCCCATA 2832 
186 

187 GGTTTTGAAC TCACAGATTA AACTGTAACC AAAATAAAAT TAGGCATATT TACAAGCTAG 2892 
188 

189 TTTCTTTCTT TCTTTTTTCT CTTTCTTTCT TTCTTTCTTT CTTTCTTTCT TTCTTTCTTT 2952 
190 

191 CTTTCTTTCT TTCTCCTTCC TTCCTTTCTT CCTTTCTTTT TTGCTGGCAA TTACAGACAA 3012 
192 

193 ATCACTCAGC AGCTACTTCA ATAACCATAT TTTCGATTTC AG AC CGT GAT AAT 3065 

194 Asn Arg Asp Asn 

195 125 
196 

197 ACC TAC AAC CGA GTG TCA GAG GAT CTG AGA AGC AGA ATT GAA GTC CTG 3113 

198 Thr Tyr Asn Arg Val Ser Glu Asp Leu Arg Ser Arg lie Glu Val Leu 

199 130 135 140 
200 

2 01 AAG CGC AAA GTC ATA GAA AAA GTA CAG CAT ATC CAG CTT CTG CAG AAA 3161 

202 Lys Arg Lys Val lie Glu Lys Val Gin His lie Gin Leu Leu Gin Lys 

203 145 150 155 
204 
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205 AAT GTT AGA GCT CAG TTG GTT GAT ATG AAA CGA CTG GAG GTAAGTATGT 3210 

206 Asn Val Arg Ala Gin Leu Val Asp Met Lys Arg Leu Glu 

207 160 165 170 
208 

209 GGCTGTGGTC CCGAGTGTCC TTGTTTTTGA GTAGAGGGAA AAGGAAGGCG ATAGTTATGC 3270 
210 

211 ACTGAGTGTC TACTATATGC AGAGAAAAGT GTTATATCCA TCATCTACCT AAAAGTAGGT 3330 
212 

213 ATTATTTTCC TCACTCCACA GTTGAAGAAA AAAAAATTCA GAGATATTAA GTAAATTTTC 33 90 
214 

215 CAACGTACAT AGATAGTAAT TCAAAGCAAT GTTCAGTCCC TGTCTATTCC AAGCCATTAC 3450 
216 

217 ATCACCACAC CTCTGAGCCC TCAGCCTGAG TTCACCAAGG ATCATTTAAT TAGCGTTTCC 3510 
218 

219 TTTGAGAGGG AATAGCACCT TACTCTTGAT CCATTCTGAG GCTAAGATGA ATTAAACAGC 3 570 
220 

221 ATCCATTGCT TATCCTGGCT AGCCCTGCAA TACCCAACAT CTCTTCCACT GAGGGTGCTC 3630 
222 

223 GATAGGCAGA AAACAGAGAA TATTAAGTGG TAGGTCTCCG AGTCAAAAAA AATGAAACCA 3690 
224 

225 GTTTCCAGAA GGAAAATTAA CTACCAGGAA CTCAATAGAC GTAGTTTATG TATTTGTATC 3 750 
226 

227 TACATTTTCT CTTTATTTTT CTCCCCTCTC TCTAG GTG GAC ATT GAT ATT AAG 3803 

228 Val Asp lie Asp lie Lys 

229 175 
230 

231 ATC CGA TCT TGT CGA GGG TCA TGC AGT AGG GCT TTA GCT CGT GAA GTA 3851 

232 lie Arg Ser Cys Arg Gly Ser Cys Ser Arg Ala Leu Ala Arg Glu Val 

233 180 185 190 
234 

235 GAT CTG AAG GAC TAT GAA GAT CAG CAG AAG CAA CTT GAA CAG GTC ATT 3899 

236 Asp Leu Lys Asp Tyr Glu Asp Gin Gin Lys Gin Leu Glu Gin Val lie 

237 195 200 205 
238 

23 9 GCC AAA GAC TTA CTT CCC TCT AGA GAT AGG CAA CAC TTA CCA CTG ATA 3947 

240 Ala Lys Asp Leu Leu Pro Ser Arg Asp Arg Gin His Leu Pro Leu lie 

241 210 215 220 
242 

243 AAA ATG AAA CCA GTT CCA GAC TTG GTT CCC GGA AAT TTT AAG AGC CAG 3 995 

244 Lys Met Lys Pro Val Pro Asp Leu Val Pro Gly Asn Phe Lys Ser Gin 

245 225 230 235 240 
246 

247 CTT CAG AAG GTA CCC CCA GAG TGG AAG GCA TTA ACA GAC ATG CCG CAG 4043 

248 Leu Gin Lys Val Pro Pro Glu Trp Lys Ala Leu Thr Asp Met Pro Gin 

249 245 250 255 
250 

251 ATG AGA ATG GAG TTA GAG AGA CCT GGT GGA AAT GAG ATT ACT CGA GGA 4091 

252 Met Arg Met Glu Leu Glu Arg Pro Gly Gly Asn Glu lie Thr Arg Gly 

253 260 265 270 
254 

255 GGC TCC ACC TCT TAT GGA ACC GGA TCA GAG ACG GAA AGC CCC AGG AAC 4139 
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Jzj 


bAb 


1 CI 


Abb 


bbL 


1LA 


A A 
bAA 
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7a TP 
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TTP 
1 X L 


71P7A 
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71 TIT 
AAX 
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ALA 


71 AP. 
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P.A A 
bAA 


TPP 
X LL 


A Q C C 
*± _7 D D 




d~\ n 
ulU 


Ser 


Arg 


P.1 w 
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CoT* 
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ulU 


COT* 
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Tl 0 
X xc 
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X 11X 


A QTl 
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560 




T) /- 

i^D 




































O O *7 


Abl 


rp/-irp 
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PAT 

LAI 


LAL 
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LL 1 
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ATA 
Al A 
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PA A 
bAA 


TTP 
1 1 L 


PPT 
LL 1 


TPP 
1 LL 


CCW 
LbX 


bb X 


AAA 
AAA 


TPT 
X L X 


enft*! 

JUU J 


TOO 


Ser 


Ser 


nlo 


nlo 


Pro 


ri~\ 


lie 


Til a 
Ala 


C~\ 11 
bX LL 


xrxxc 


It X LV 


Cot* 
OCX 


Arrr 

Axrg 


uiy 


xjy 0 


Opr 

OCX. 
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DO D 










CTft 
D / L/ 










j / — / 






-"J ""J O 




































in 
jjI 


1 LA 


apt 
Aval 


T* 7\ P 
1 AL 


AbL 


A A A 
AAA 


LAA 


TTT 


TV PT 
AL 1 


7APT 
Ab X 


71PP 
AbL 


71 PP 
ALb 


71 

Abx 


TAP 
X AL 


A AP 
AA^. 


AP.A 
AbA 


bbA 


jU Jl 


JjZ 


Cot* 


Co T* 
OCX 


Twt* 

iyr 


Cot** 
OCX 


Xiy 0 


m n 

Olil 


IrlXC 


Thr 
X 11X 


Cot* 
OCX 


Cot* 

OCX 


± XXX 


Cot* 

OCX 


Tvr 

xyx7 


Asn 


Arci 
Axy 


Gly 












O O VJ 










CQC 
D O D 










Z/ \J 








334 








































ILL 


ap a 

ALA 


TTT 


GAA 


AGC 


AAG 


AGC 


TAT 


AAA 


ATG 


GCA 


GAT 


GAG 


GCC 


aa& 

uvjn 


5099 


*a i c 
job 


Asp 


Ser 


TVi v* 

inr 


rile 


Glu 


Ser 


Lys 


Ser 


Tyr 


Lys 


Met 


Ala 


Asp 


Glu 


Ala 


P,l V 

bx y 




^ 1 *7 
















600 










605 










"5 o o 




































339 


AGT 


GAA 


GCC 


GAT 


CAT 


GAA 


GGA 


ACA 


CAT 


AGC 


ACC 


AAG 


AGA 


GGC 


CAT 


GCT 


5147 


340 


Ser 


Glu 


Ala 


Asp 


His 


Glu 


Gly 


Thr 


His 


Ser 


Thr 


Lys 


Arg 


Gly 


His 


Ala 




341 




610 










615 










620 












342 




































343 


AAA 


TCT 


CGC 


CCT 


GTC 


AGA 


GGT 


ATC 


CAC 


ACT 


TCT 


CCT 


TTG 


GGG 


AAG 


CCT 


5195 


344 


Lys 


Ser 


Arg 


Pro 


Val 


Arg 


Gly 


He 


His 


Thr 


Ser 


Pro 


Leu 


Gly 


Lys 


Pro 




345 


625 










630 










635 










640 




346 




































347 


TCC 


CTG 


TCC 


CCC 


TAGACTAAGT TAAATATTTC TGCACAGTGT TCCCATGGCC 




5247 


348 


Ser 


Leu 


Ser 


Pro 




























349 










645 


























350 




































351 


CCTTGCATTT CCTTCTTAAC TCTCTGTTAC ACGTCATTGA 


AACTACACTT TTTTGGTCTG 


5307 


352 
353 


TTTTTGTGCT AGACTGTAAG TTCCTTGGGG GCAGGGCCTT 


TGTCTGTCTC ATCTCTGTAT 


5367 


354 
355 


TCCCAAATGC CTAACAGTAC AGAGCCATGA CTCAATAAAT 


ACATGTTAAA TGGATGAATG 


5427 


356 
357 


AATTCCTCTG AAACTCTATT TGAGCTTATT TAGTCAAATT 


CTTTCACTAT TCAAAGTGTG 


5487 



PAGE: 8 RAW SEQUENCE LISTING DATE: 03/23/94 

PATENT APPLICATION US/08/206,176 TIME: 12:13:56 

INPUT SET; S1765.raw 

358 

359 TGCTATTAGA ATTGTCACCC AACTGATTAA TCACATTTTT AGTATGTGTC TCAGTTGACA 5547 
360 

361 TTTAGGTCAG GCTAAATACA AGTTGTGTTA GTATTAAGTG AGCTTAGCTA CCTGTACTGG 5607 
362 

363 TTACTTGCTA TTAGTTTGTG CAAGTAAAAT TCCAAATACA TTTGAGGAAA ATCCCCTTTG 5667 
364 

365 CAATTTGTAG GTATAAATAA CCGCTTATTT GCATAAGTTC TATCCCACTG TAAGTGCATC 5727 
366 

367 CTTTCCCTAT GGAGGGAAGG AAAGGAGGAA GAAAGAAAGG AAGGGAAAGA AACAGTATTT 5787 
368 

369 GCCTTATTTA ATCTGAGCCG TGCCTATCTT TGTAAAGTTA AATGAGAATA ACTTCTTCCA 5847 
370 

371 ACCAGCTTAA TTTTTTTTTT AGACTGTGAT GATGTCCTCC AAACACATCC TTCAGGTACC 5907 
372 

373 CAAAGTGGCA TTTTCAATAT CAAGCTATCC GGATCC 5943 

374 

375 

376 (2) INFORMATION FOR SEQ ID NO:2: 
377 

378 (i) SEQUENCE CHARACTERISTICS: 

379 (A) LENGTH: 644 amino acids 

380 (B) TYPE: amino acid 

381 (D) TOPOLOGY: linear 
382 

383 (ii) MOLECULE TYPE: protein 

384 

385 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:2: 

386 

387 Met Phe Ser Met Arg He Val Cys Leu Val Leu Ser Val Val Gly Thr 

388 1 5 10 15 
389 

390 Ala Trp Thr Ala Asp Ser Gly Glu Gly Asp Phe Leu Ala Glu Gly Gly 

391 20 25 30 
392 

393 Gly Val Arg Gly Pro Arg Val Val Glu Arg His Gin Ser Ala Cys Lys 

394 35 40 45 
395 

396 Asp Ser Asp Trp Pro Phe Cys Ser Asp Glu Asp Trp Asn Tyr Lys Cys 

397 50 55 60 
398 

399 Pro Ser Gly Cys Arg Met Lys Gly Leu He Asp Glu Val Asn Gin Asp 

400 65 70 75 80 
401 

402 Phe Thr Asn Arg He Asn Lys Leu Lys Asn Ser Leu Phe Glu Tyr Gin 

403 85 90 95 
404 

405 Lys Asn Asn Lys Asp Ser His Ser Leu Thr Thr Asn He Met Glu He 

406 100 105 110 
407 

408 Leu Arg Gly Asp Phe Ser Ser Ala Asn Asn Arg Asp Asn Thr Tyr Asn 



PAGE: 9 RAW SEQUENCE LISTING DATE: 03/23/94 

PATENT APPLICATION US/08/206,176 TIME: 12:14:01 

INPUT SET: S1765.raw 

409 115 120 125 

410 

411 Arg Val Ser Glu Asp Leu Arg Ser Arg lie Glu Val Leu Lys Arg Lys 

412 130 135 140 
413 

414 Val lie Glu Lys Val Gin His lie Gin Leu Leu Gin Lys Asn Val Arg 

415 145 150 155 160 
416 

417 Ala Gin Leu Val Asp Met Lys Arg Leu Glu Val Asp lie Asp lie Lys 

418 165 170 175 
419 

420 lie Arg Ser Cys Arg Gly Ser Cys Ser Arg Ala Leu Ala Arg Glu Val 

421 180 185 190 
422 

423 Asp Leu Lys Asp Tyr Glu Asp Gin Gin Lys Gin Leu Glu Gin Val lie 

424 195 200 205 
425 

426 Ala Lys Asp Leu Leu Pro Ser Arg Asp Arg Gin His Leu Pro Leu lie 

427 210 215 220 
428 

429 Lys Met Lys Pro Val Pro Asp Leu Val Pro Gly Asn Phe Lys Ser Gin 

430 225 230 235 240 
431 

432 Leu Gin Lys Val Pro Pro Glu Trp Lys Ala Leu Thr Asp Met Pro Gin 

433 245 250 255 
434 

435 Met Arg Met Glu Leu Glu Arg Pro Gly Gly Asn Glu lie Thr Arg Gly 

436 260 265 270 
437 

438 Gly Ser Thr Ser Tyr Gly Thr Gly Ser Glu Thr Glu Ser Pro Arg Asn 

439 275 280 285 
440 

441 Pro Ser Ser Ala Gly Ser Trp Asn Ser Gly Ser Ser Gly Pro Gly Ser 

442 290 295 300 
443 

444 Thr Gly Asn Arg Asn Pro Gly Ser Ser Gly Thr Gly Gly Thr Ala Thr 

445 305 310 315 . 320 
446 

447 Trp Lys Pro Gly Ser Ser Gly Pro Gly Ser Ala Gly Ser Trp Asn Ser 

448 325 330 335 
449 

450 Gly Ser Ser Gly Thr Gly Ser Thr Gly Asn Gin Asn Pro Gly Ser Pro 

451 340 345 350 
452 

453 Arg Pro Gly Ser Thr Gly Thr Trp Asn Pro Gly Ser Ser Glu Arg Gly 

454 355 360 365 
455 

456 Ser Ala Gly His Trp Thr Ser Glu Ser Ser Val Ser Gly Ser Thr Gly 

457 370 375 380 
458 

459 Gin Trp His Ser Glu Ser Gly Ser Phe Arg Pro Asp Ser Pro Gly Ser 



PAGE: 10 RAW SEQUENCE LISTING DATE: 03/23/94 

PATENT APPLICATION US/08/206,176 TIME: 12:14:06 

INPUT SET: S1765.raw 



460 385 390 395 400 

461 

462 Gly Asn Ala Arg Pro Asn Asn Pro Asp Trp Gly Thr Phe Glu Glu Val 

463 405 410 415 
464 

465 Ser Gly Asn Val Ser Pro Gly Thr Arg Arg Glu Tyr His Thr Glu Lys 

466 420 425 430 
467 

468 Leu Val Thr Ser Lys Gly Asp Lys Glu Leu Arg Thr Gly Lys Glu Lys 

469 435 440 445 
470 

471 Val Thr Ser Gly Ser Thr Thr Thr Thr Arg Arg Ser Cys Ser Lys Thr 

472 450 455 460 
473 

474 Val Thr Lys Thr Val lie Gly Pro Asp Gly His Lys Glu Val Thr Lys 

475 465 470 475 480 
476 

477 Glu Val Val Thr Ser Glu Asp Gly Ser Asp Cys Pro Glu Ala Met Asp 

478 485 490 495 
479 

480 Leu Gly Thr Leu Ser Gly lie Gly Thr Leu Asp Gly Phe Arg His Arg 

481 500 505 510 
482 

483 His Pro Asp Glu Ala Ala Phe Phe Asp Thr Ala Ser Thr Gly Lys Thr 

484 515 520 525 
485 

486 Phe Pro Gly Phe Phe Ser Pro Met Leu Gly Glu Phe Val Ser Glu Thr 

487 530 535 540 
488 

489 Glu Ser Arg Gly Ser Glu Ser Gly lie Phe Thr Asn Thr Lys Glu Ser 

490 545 550 555 560 
491 

492 Ser Ser His His Pro Gly lie Ala Glu Phe Pro Ser Arg Gly Lys Ser 

493 565 570 575 
494 

495 Ser Ser Tyr Ser Lys Gin Phe Thr Ser Ser Thr Ser Tyr Asn Arg Gly 

496 580 585 590 
497 

498 Asp Ser Thr Phe Glu Ser Lys Ser Tyr Lys Met Ala Asp Glu Ala Gly 

499 595 600 605 
500 

501 Ser Glu Ala Asp His Glu Gly Thr His Ser Thr Lys Arg Gly His Ala 

502 610 615 620 
503 

504 Lys Ser Arg Pro Val Arg Gly lie His Thr Ser Pro Leu Gly Lys Pro 

505 625 630 635 640 
506 

507 Ser Leu Ser Pro 

508 

509 

510 (2) INFORMATION FOR SEQ ID NO: 3: 



PAGE: 11 



511 
512 
513 
514 
515 
516 
517 
518 
519 
520 
521 
522 
523 
524 
525 
526 
527 
528 
529 
530 
531 
532 
533 
534 
535 
536 
537 
538 
539 
540 
541 
542 
543 
544 
545 
546 
547 
548 
549 
550 
551 
552 
553 
554 
555 
556 
557 
558 
559 
560 
561 



RAW SEQUENCE LISTING DATE: 03/23/94 

PATENT APPLICATION US/08/206,176 TIME: 12:14:12 

INPUT SET: S1765.raw 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8878 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(vii) IMMEDIATE SOURCE: 

(B) CLONE: human fibrinogen B-beta chain 

(ix) FEATURE: 

(A) NAME/KEY: misc_RNA 

(B) LOCATION: 1..469 



(ix) FEATURE: 

(A) NAME/KEY: exon 

(B) LOCATION: 470.. 583 



(ix) FEATURE: 

(A) NAME/KEY: intron 

(B) LOCATION: 584.. 3257 



(ix) FEATURE: 

(A) NAME/KEY: exon 

(B) LOCATION: 3258.. 3449 



(ix) FEATURE: 

(A) NAME/KEY: intron 

(B) LOCATION: 3450.. 3938 



(ix) FEATURE: 

(A) NAME / KEY : exon 

(B) LOCATION: 3939.. 4122 



(ix) FEATURE: 

(A) NAME/KEY: intron 

(B) LOCATION: 4123.. 5042 



(ix) FEATURE: 

(A) NAME /KEY: exon 

(B) LOCATION: 5043.. 5270 



(ix) FEATURE: 

(A) NAME/KEY: intron 

(B) LOCATION: 5271.. 5830 



(ix) FEATURE: 

(A) NAME/KEY: exon 



PAGE: 12 RAW SEQUENCE LISTING DATE: 03/23/94 

PATENT APPLICATION US/08/206,176 TIME: 12:14:17 

INPUT SET: S1765.raw 



562 (B) LOCATION: 5831.. 5944 

563 

564 (ix) FEATURE: 

565 (A) NAME /KEY: intron 

566 (B) LOCATION: 5945.. 6632 
567 

568 (ix) FEATURE: 

569 (A) NAME/KEY: exon 

570 (B) LOCATION: 6633.. 6758 
571 

572 (ix) FEATURE: 

573 (A) NAME /KEY : intron 

574 (B) LOCATION: 6759.. 6966 
575 

576 (ix) FEATURE: 

577 (A) NAME/KEY: exon 

578 (B) LOCATION: 6967.. 7252 
579 

580 (ix) FEATURE: 

581 (A) NAME/KEY: intron 

582 (B) LOCATION: 7253.. 7870 
583 

584 

585 (ix) FEATURE: 

586 (A) NAME/KEY: exon 

587 (B) LOCATION: 7871.. 8102 
588 

589 (ix) FEATURE: 

590 (A) NAME /KEY : 3'UTR 

591 (B) LOCATION: 8103.. 8537 
592 

593 (ix) FEATURE: 

594 (A) NAME /KEY: misc_RNA 

595 (B) LOCATION: 8538.. 8878 
596 

597 (ix) FEATURE: 

598 (A) NAME/KEY: CDS 

599 (B) LOCATION: join (470 .. 583 , 3258.. 3449, 3939.. 4122, 5043.. 5270, 

600 5831.. 5944, 6633.. 6758, 6967.. 7252, 7871.. 8102) 
601 

602 

603 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:3: 
604 



605 GAATTCATGC CCCTTTTGAA ATAGACTTAT GTCATTGTCA GAAAACATAA GCATTTATGG 60 
606 

607 TATATCATTA ATGAGTCACG ATTTTAGTGG TTGCCTTGTG AGTAGGTCAA ATTTACTAAG 120 
608 

609 CTTAGATTTG TTTTCTCACA TATTCTTTCG GAGCTTGTGT AGTTTCCACA TTAATTTACC 180 
610 

611 AGAAACAAGA TACACACTCT CTTTGAGGAG TGCCCTAACT TCCCATCATT TTGTCCAATT 240 
612 



PAGE: 13 RAW SEQUENCE LISTING DATE: 03/23/94 

PATENT APPLICATION US/08/206,176 TIME: 12:14:23 

INPUT SET: S1765.raw 

613 AAATGAATTG AAGAAATTTA ATGTTTCTAA ACTAGACCAA CAAAGAATAA TAGTTGTATG 300 
614 

615 ACAAGTAAAT AAGCTTTGCT GGGAAGATGT TGCTTAAATG ATAAAATGGT TCAGCCAACA 360 
616 

617 AGTGAACCAA AAATTAAATA TTAACTAAGG AAAGGTAACC ATTTCTGAAG TCATTCCTAG 420 
618 

619 CAGAGGACTC AGATATATAT AGGATTGAAG ATCTCTCAGT TAAGTCTAC ATG AAA 475 

620 Met Lys 

621 1 
622 

623 AGG ATG GTT TCT TGG AGC TTC CAC AAA CTT AAA ACC ATG AAA CAT CTA 523 

624 Arg Met Val Ser Trp Ser Phe His Lys Leu Lys Thr Met Lys His Leu 

625 5 10 15 
626 

627 TTA TTG CTA CTA TTG TGT GTT TTT CTA GTT AAG TCC CAA GGT GTC AAC 571 

628 Leu Leu Leu Leu Leu Cys Val Phe Leu Val Lys Ser Gin Gly Val Asn 

629 20 25 30 
630 

631 GAC AAT GAG GAG GTGAATTTTT TAAAGCATTA TTATATTATT AGTAGTATTA 623 

632 Asp Asn Glu Glu 

633 35 
634 

635 TTAATATAAG ATGTAACATA ATCATATTAT GTGCTTATTT TAATGAAATT AGCATTGCTT 683 
636 

63 7 ATAGTTATGA AATGGAATTG TTAACCTCTG ACTTATTGTA TTTAAAGAAT GTTTCATAGT 743 
638 

63 9 ATTTCTTATA TAAAAACAAA GTAATTTCTT GTTTTCTAGT TTATCACCTT TGTTTTCTTA 803 
640 

641 AGATGAGGAT GGCTTAGCTA ATGTAAGATG TGTTTTTCTC ACTTGCTATT CTGAGTACTG 863 
642 

643 TGATTTTCAT TTACTTCTAG CAATACAGGA TTACAATTAA GAGGACAAGA TCTGAAAATC 923 
644 

645 TCACAAACTA TAAAATAATA AAAGAGCAGA ATTTTAAGAT AAAAGAAACT GGTGGTAGGT 983 
646 

647 AGATTGTTCT TTGGTGAAGG AAGGTAATAT ATATTGTTAC TGAGATTACT ATTTATAAAA 1043 
648 

649 ATTATAACTA AGCCTAAAAG CAAAATACAT CAAGTGTAAT GATAGAAAAT GAAATATTGC 1103 
650 

651 TTTTTTCAGA TGAAAAGTTC AAATTAGAGT TAGTGTGTAT TGTTATTATT AATAGTTATG 1163 
652 

653 AAACACGGTT CAGTCTAATT TATTTATTTG TAGAACAGTT TGTCCTCAAC TATTATTTTT 1223 
654 

655 GCTGACTTAT TGCTGTTAAT TTGCAGTTAC TAAAAATACA GAAATGCATT TAGGACAATG 1283 
656 

657 GATATTTAAG AAATTTAAAT TTTATCATCA AACGTATCAT GGCCAAATTT CTTACATATA 1343 
658 

659 GCATAGTATC ATTAAACTAG AAATAAGAAT ACACAATAAT ATTTAAATGA AGTGATTCAT 1403 
660 

661 TTCGGATCAT TATTGAGTTT CAAGGGAACT TGAGTGTTGT ACTTATCAGA CTCTACATGT 1463 
662 

663 AAGAACATAT AGTTAATCTG GTTGTGTGTG TAAAAACATA TGGTTAATCT GGTTAAGTCT 1523 



PAGE: 14 RAW SEQUENCE LISTING DATE: 03/23/94 

PATENT APPLICATION US/08/206,176 TIME: 12:14:30 

INPUT SET: S1765.raw 

664 

665 GGTTAATCAT ATTAGGTAAG AAAAATGTAA AGAATGTGTA AGACGAAATT TTTGTAAAGT 1583 
666 

667 ACTCTGCAAA GCACTTTCAC ATTTCTGCTT ATCAACTAAA CCTCACAGAG ATAGTTTAAT 1643 
668 

669 AGTTTAGGCT TTAAAATGGA TTTTGATTAT TCAACAAGTG GCCTTCATAA TTTCTTTAAG 1703 
670 

671 TGTTTTTCTT TAAGTATATA CTTTCTTTAA ATATTTTTTA AAATTTCCTT TTCTCTAGTA 1763 
672 

673 AAGCCAGACC ATCCATGCTA CCTCTCTAGT GGCACTCTGA AATAAAAAGA AAATAGTTTT 1823 
674 

675 CTCTGTTATA ATTGTATTTG TAATAAGCAG ATGAATCACA TTTCTTAAAA TTTGTTTTAG 1883 
676 

677 AGAGGGTAAG CTCTGACTAG GACCATGACT TCAATGTGAA ATATGTATAT ATCCTCCGAA 1943 
678 

679 TCTTTACATA TTAAGAATGT ATATAGTCAA CTGGTTAAAC AGGAAAATCT GGAACAGCCT 2003 
680 

681 GGCTGGGTTT TAATCTTAGC ACCATCCTAC TAAATGTTAA ATAATATTAT AATCTAATGA 2063 
682 

683 ATAAATGACA ATGCAATTCC AAATAGAGTT CATCTGATGA CTTCTAGACT CACAAAATTG 2123 
684 

685 CAAGAGAGCT CAGTTGTTGC TCAGTTGTTC CAAATCATGT CGTTTGTTAA TTTGTAATTA 2183 
686 

687 AGCTCCAAAG GATGTATAGC TACTGACAAA AAAAAAAATG AGAATGTAGT TAATCCAAAT 2243 
688 

689 CAAAACTTTC CTATTGCAAT GCGTATTTTC TGCTTCATTA TCCTTTAATA TAATATTTTA 2303 
690 

691 AGTTAGCAAG TAATTTTAAT TACAATGCAC AAGCCTTGAG AATTATTTTA AATATAAGAA 2363 
692 

693 AATCATAATG TTTGATAAAG AAATCATGTA AGAAATTTCA AGATAATGGT TTAACAAATA 2423 
694 

695 ATTTTGTTGA TAGAAGATAA GACTAAAAGT GAAATTCGAA GTGGAGAGGA CACTTAAACT 2483 
696 

697 GTAGTACTTG TTATGTGTGA TTCCAGTAAA AATAGTAATG AGCACTTATT ATTGCCAAGT 2543 
698 

699 ACTGTTCTGA GGGTACCATA TGCAATAAGT TATTTAATCC TTACAATAAT CTTGTAAGGC 2603 
700 

701 AGATTCAAAC TATCATTACA CTTATTTTAC AGATGAGAAA ACTGGGGCAC AGATAAAGCA 2663 
702 

703 ACTTGCCCAA GGTCTCATAG CTGTAAGTCA ACCCTACGGT CAAGACCTAC AAGTAGCCGA 2723 
704 

705 GCTCCAGAGT ACATTATGAG GGTCAAAGAT TGTCTTATTA CAAATAAATT CCAAGTAGAA 2783 
706 

707 TCAACCTTTA ATAAGTCTTT AATGTCTCTT AAATATGTTT ATATAGGAGT CTAATCACCA 2843 
708 

709 ATTCACAAAA ATGAAAGTAG GGAAATGATT AACAATAATC ATAGGAATCT AACAATCCAA 2 903 
710 

711 GTGGCTTGAG AATATTCATT CTTCTTGACA GTATAGATTC TTTACAATTT CGTAAGTTCC 2 963 
712 

713 AATGTATGTT TTAGGAATAT GAGGTCATTA CTATTCATAA TCTGATACAG CTTTATCCTA 3 023 
714 



PAGE: 15 RAW SEQUENCE LISTING DATE: 03/23/94 

PATENT APPLICATION US/08/206,176 TIME: 12:14:37 

INPUT SET: S1765.raw 



715 AGGCCTCTCT TTAAAAACTA CACTGCATCA TAGCTTTTTT GTGCAGTTGG TCTTTCTACT 3 083 
716 

717 GTTACTGAAC AGTAAGCAAC CTACAGATTC ACTATCACCA ACCAGCCAGT TGATGGATCT 3143 
718 

719 TAAGCAAATT ATCAAGCTTG TGATAACCTA AATTATAAAA TGAGGGTGTT GGAATAGTTA 3203 
720 

721 CATTCCAAAT CTTCTATAAC ACTCTGTATT ATATTTCTGC CTCATTCCTT GTAG GGT 3260 



722 Gly 

723 

724 

725 TTC TTC AGT GCC CGT GGT CAT CGA CCC CTT GAC AAG AAG AGA GAA GAG 3308 

726 Phe Phe Ser Ala Arg Gly His Arg Pro Leu Asp Lys Lys Arg Glu Glu 

727 40 45 50 55 
728 

729 GCT CCC AGC CTG AGG CCT GCC CCA CCG CCC ATC AGT GGA GGT GGC TAT 3356 

730 Ala Pro Ser Leu Arg Pro Ala Pro Pro Pro lie Ser Gly Gly Gly Tyr 

731 60 65 70 
732 

733 CGG GCT CGT CCA GCC AAA GCA GCT GCC ACT CAA AAG AAA GTA GAA AGA 3404 

734 Arg Ala Arg Pro Ala Lys Ala Ala Ala Thr Gin Lys Lys Val Glu Arg 

735 75 80 85 
736 

737 AAA GCC CCT GAT GCT GGA GGC TGT CTT CAC GCT GAC CCA GAC CTG 3449 

738 Lys Ala Pro Asp Ala Gly Gly Cys Leu His Ala Asp Pro Asp Leu 

739 90 95 100 



740 

741 GTGGGTGCAC TGATGTTTCT TGCAGTGGTG GCTCTCTCAT GCAGAGAAAG CCTGTAGTCA 3509 
742 

743 TGGCAGTCTG CTAATGTTTC ACTGACCCAC ATTACCATCA CTGTTATTTT GTTTGTTTAT 3569 
744 

745 TTTGGAAATA AAATTCAAAA CATAAACATA TTGGGCCTTT GGTTTAGGCT TTCTTTCTTG 362 9 
746 

747 TTTTCTTTGG TCTGGGCCCA AAATTTCAAA TTAGGATATG TGGGTGCCAC CTTTCCATTT 3689 
748 

74 9 GTATTTTGCC ACTGCCTTTG TTTAGTTGGT AAAATTTTCA TAGCCCAATT ATATTTTTTC 3 749 
750 

751 TGGGGTAAGT AATATTTTAA ATCTCTATGA GAGTATGATG ATGACTTTCG AATTTCTGGT 3809 
752 

753 CTTACAGAAA ACCAAATAAT AAATTTTTAT GTTGGCTAAT CGTATCGCTG AATTTTCCTA 3869 
754 

755 TGTGCTATTT TAACAAATGT CCATGACCCA AATCCTTCAT CTAATGCCTG CTATTTTCTT 3929 
756 

757 TGTTTTTAG GGG GTG TTG TGT CCT ACA GGA TGT CAG TTG CAA GAG GCT 3977 



758 Gly Val Leu Cys Pro Thr Gly Cys Gin Leu Gin Glu Ala 

759 105 110 115 
760 

761 TTG CTA CAA CAG GAA AGG CCA ATC AGA AAT AGT GTT GAT GAG TTA AAT 4025 

762 Leu Leu Gin Gin Glu Arg Pro lie Arg Asn Ser Val Asp Glu Leu Asn 

763 120 125 130 
764 

765 AAC AAT GTG GAA GCT GTT TCC CAG ACC TCC TCT TCT TCC TTT CAG TAC 4073 



PAGE: 16 RAW SEQUENCE LISTING DATE: 03/23/94 

PATENT APPLICATION US/08/206,176 TIME: 12:14:44 

INPUT SET: S1765.raw 

766 Asn Asn Val Glu Ala Val Ser Gin Thr Ser Ser Ser Ser Phe Gin Tyr 

767 135 140 145 
768 

769 ATG TAT TTG CTG AAA GAC CTG TGG CAA AAG AGG CAG AAG CAA GTA AAA G 4122 

770 Met Tyr Leu Leu Lys Asp Leu Trp Gin Lys Arg Gin Lys Gin Val Lys 

771 150 155 160 
772 

773 GTAGATATCC TTGTGCTTTC CATTCGATTT TCAGCTATAA AATTGGAACC GTTAGACTGC 4182 
774 

775 CACGAGAATG CATGGTTGTG AGAAGATTAA CATTTCTGGG TTAGTGAATA GCATTCATAC 4242 
776 

777 GCTTTTGGGC ACCTTCCCCT GCAACTTGCC AGATAAGCAC TATTCAGCTC TTATTCCCAG 4302 
778 

779 TCTGACATCA GCAAGTGTGA TTTTCTATGA AAAATTCTAC TATGACTCCT TATTTTAAGT 4362 
780 

781 ATACAAGAAA CTTGTGACTC AGAAGATAAT ATTTACAGAG TGGAAAAAAA CCCCTAGCAT 4422 
782 

783 TTATAGTTTT AACATTTGAG GTTTTGAATG AGAGAGTTAT CCATAATATA TTCAATTGTG 4482 
784 

785 TTGTGGATAA TGACACCTAA CCTGTGAATC TTGAGGTCAG AATGTTGAGT GCTGTTGACT 4542 
786 

787 TGGTGGTCAG GAAACAGCTA GTGCGTGAGC CTGGCACAGG CATCTCAGTG AGTAGCATAC 4602 
788 

789 CCACAGTTGG AAATTTTTCA AAGAAATCAA AGGAATCATG ACATCTTATA AATTTCAAGG 4662 
790 

791 TTCTGCTATA CTTATGTGAA ATGGATAAAT AAATCAAGCA TATCCACTCT GTAAGATTGA 4722 
792 

793 ACTTCTCAGA TGGAAGACCC CAATACTGCT TTCTCCTCTT TTCCCTCACC AAAGAAATAA 4782 
794 

795 ACAACCTATT TCATTTATTA CTGGACACAA TCTTTAGCGT ATACCTATGG TAAATTACTA 4842 
796 

797 GTATGGTGGT TAGGATTTAT GTTAATTTGT ATATGTCATG CGCCAAATCA TTTCCACTAA 4902 
798 

799 ATATGACTAT ATATCATAAC TGCTTGGTGA TAGCTCAGTG TTTAATAGTT TATTCTCAGA 4 962 
800 

801 AAATCAAAAT TGTATAGTTA AATACATTAG TTTTATGAGG CAAAAATGCT AACTATTTCT 5022 
802 

803 ACATAATTTC ATTTTTCCAG AT AAT GAA AAT GTA GTC AAT GAG TAC TCC 5071 

804 Asp Asn Glu Asn Val Val Asn Glu Tyr Ser 

805 165 170 
806 

807 TCA GAA CTG GAA AAG CAC CAA TTA TAT ATA GAT GAG ACT GTG AAT AGC 5119 

808 Ser Glu Leu Glu Lys His Gin Leu Tyr lie Asp Glu Thr Val Asn Ser 

809 175 180 185 
810 

811 AAT ATC CCA ACT AAC CTT CGT GTG CTT CGT TCA ATC CTG GAA AAC CTG 5167 

812 Asn lie Pro Thr Asn Leu Arg Val Leu Arg Ser lie Leu Glu Asn Leu 

813 190 195 200 205 
814 

815 AGA AGC AAA ATA CAA AAG TTA GAA TCT GAT GTC TCA GCT CAA ATG GAA 5215 

816 Arg Ser Lys lie Gin Lys Leu Glu Ser Asp Val Ser Ala Gin Met Glu 



PAGE: 17 RAW SEQUENCE LISTING DATE: 03/23/94 

PATENT APPLICATION US/08/206,176 TIME: 12:14:51 

INPUT SET; S1765.raw 

817 210 215 220 

818 

819 TAT TGT CGC ACC CCA TGC ACT GTC AGT TGC AAT ATT CCT GTG GTG TCT 5263 

820 Tyr Cys Arg Thr Pro Cys Thr Val Ser Cys Asn lie Pro Val Val Ser 

821 225 230 235 
822 

823 GGC AAA G GTAACTGATT CATAAACATA TTTTTAGAGA GTTCCAGAAG AACTCACACA 5320 

824 Gly Lys 
825 

826 

827 CCAAAAATAA GAGAACAACA ACAACAACAA AAATGCTAAG TGGATTTTCC CAACAGATCA 5380 
828 

829 TAATGACATT ACAGTACATC ATAAAAATAT CCTTAGCCAG TTGTGTTTTG GACTGGCCTG 5440 
830 

831 GTGCATTTGC TGGTTTTGAT GAGCAGGATG GGGCACAGGT AGTCCCAGGG GTGGCTGATG 5500 
832 

833 TGTGCATCTG CGTACTGGCT TGAACAGATG GCAGAACCAC AGATAGATGT AGAAGTTTCT 5560 
834 

835 CCATTTTGTG TGTTCTGGGA GCTCATGGAT ATTCCAGGAC ACAAAAGGTG GAGAAGAGCT 5620 
836 

837 TTGTTCATCC TCTTAGCAGA TAAACGTCCT CAAAACTGGG TTGGACTTAC TAAAGTAAAA 5680 
838 

839 TGAAAATCTA ATATTTGTTA TATTATTTTC AAAGGTCTAT AATAACACAC TCCTTAGTAA 5740 
840 

841 CTTATGTAAT GTTATTTTAA AGAATTGGTG ACTAAATACA AAGTAATTAT GTCATAAACC 5800 
842 

843 CCTGAACATA ATGTTGTCTT ACATTTGCAG AA TGT GAG GAA ATT ATC AGG AAA 5853 

844 Glu Cys Glu Glu lie lie Arg Lys 

845 240 245 
846 

847 GGA GGT GAA ACA TCT GAA ATG TAT CTC ATT CAA CCT GAC AGT TCT GTC 5901 

848 Gly Gly Glu Thr Ser Glu Met Tyr Leu lie Gin Pro Asp Ser Ser Val 

849 250 255 260 
850 

851 AAA CCG TAT AGA GTA TAC TGT GAC ATG AAT ACA GAA AAT GGA G 5944 

852 Lys Pro Tyr Arg Val Tyr Cys Asp Met Asn Thr Glu Asn Gly 

853 265 270 275 
854 

855 GTAAGCTTTC GACAGTTGTT GACCTGTTGA TCTGTAATTA TTTGGATACC GTAAAATGCC 6 004 
856 

857 AGGAAACAAG GCCAGGTGTG GTGGCTCATA CCTGTAATTC CAGCACCTTG GGAGGCCAAA 6064 
858 

859 GTGGGCTGAT AGCTTGAGCC TAGGAGTTTG AAACTAGCCT GGGCAACATA ATGAGACCCT 6124 
860 

861 AACTCTACAA AAAAAAAAAA AAT AC C AAAA AAAAAAAAAA AATCAGCTGT GTTGGTAGTA 6184 
862 

863 TGTGCCTGTA GTCCCAGCTA TCCAGGAGGC TGAGATGGGA GATCACCTGA GCCCACAACC 6244 
864 

865 TGGAGTCTTG ATCATGCTAC TGAACTGTAG CCTGGGCAAC AGAGGATAGT GAGATCCTGT 6304 
866 

867 CTCAAAAAAA AAAATTAATT AAAAAGCCAG GAAACAAGAC TTAGCTCTAA CATCTAACAT 6364 



PAGE: 18 RAW SEQUENCE LISTING DATE: 03/23/94 

PATENT APPLICATION US/08/206,176 TIME: 12:14:57 

INPUT SET: S1765.raw 



868 




































869 


AGCTGACAAA GGAGTAATTT GATGTGGAAT TCAACCTGAT 


ATTTAAAAGT TATAAAATAT 


/~ A A 

6424 


O/O 

871 


CTATAATTCA CAATTTGGGG TAAGATAAAG CACTTGCAGT 


TTCCAAAGAT TTTACAAGTT 


£ A Q A 


872 
873 


TACCTCTCAT ATTTATTTCC TTATTGTGTC TATTTTAGAG 


CACCAAATAT ATACTAAATG 


6544 


874 




































875 


GAATGGACAG GGGATTCAGA TATTATTTTC AAAGTGACAT 


TATTTGCTGT TGGTTAATAT 


ooU4 


876 




































877 


ATGCTCTTTT TGTTTCTGTC AACCAAAG 


GA TGG ACA GTG ATT CAG AAC CGT 


6655 


878 
















Gly Trp Thr Val He Gin Asn Arg 




879 




















280 








285 




880 




































881 


CAA 


GAC 


GGT 


AGT 


GTT 


GAC 


TTT 


GGC 


AGG 


AAA 


TGG 


GAT 


CCA 


TAT 


AAA 


CAG 


6703 


882 


Gin Asp 


Gly Ser Val Asp 


Phe 


Gly Arg Lys 


Trp 


Asp 


Pro 


Tyr 


Lys 


Gin 




883 










290 










295 










300 






884 




































885 


GGA 


TTT 


GGA 


AAT 


GTT 


GCA 


ACC 


AAC 


ACA 


GAT 


GGG 


AAG 


AAT 


TAC 


TGT 


GGC 


6751 


o o c 

886 


Gly Phe 


Gly Asn Val 


Ala 


Thr 


Asn Thr Asp Gly 


Lys 


Asn 


Tyr 


Cys 


Gly 




887 








305 










310 










315 








888 




































889 


CTA 


CCA 


G GTAACGAACA GGCATGCAAA ATAAAATCAT 


TCTATTTGAA ATGGGATTTT 


6808 


890 


Leu 


Pro 
































891 




































892 




































893 


TTTTAATTAA AAAACATTCA TTGTTGGAAG CCTGTTTTAG 


GCAGTTAAGA GGAGTTTCCT 


6868 


894 
895 


GACAAAAATG TGGAAGCTAA AGATAAGGGA AGAAAGGCAG 


TTTTTAGTTT CCCAAAATTT 


6928 


896 




































897 


TATTTTTGGT GAGAGATTTT ATTTTGTTTT TCTTTTAG GT GAA TAT TGG CTT 


6980 


898 






















Gly Glu Tyr Trp Leu 




899 






















320 












900 




































901 


GGA 


AAT 


GAT 


AAA 


ATT 


AGC 


CAG 


CTT 


ACC 


AGG 


ATG 


GGA 


CCC 


ACA 


GAA 


CTT 


7028 


902 


Gly 


Asn 


Asp 


Lys 


He 


Ser 


Gin 


Leu 


Thr 


Arg 


Met 


Gly 


Pro 


Thr 


Glu 


Leu 




903 


325 










330 










335 










340 




904 




































905 


TTG 


ATA 


GAA 


ATG 


GAG 


GAC 


TGG 


AAA 


GGA 


GAC 


AAA 


GTA 


AAG 


GCT 


CAC 


TAT 


7076 


906 


Leu 


He 


Glu 


Met 


Glu 


Asp 


Trp 


Lys 


Gly 


Asp 


Lys 


Val 


Lys 


Ala 


His 


Tyr 




907 










345 










350 










355 






908 




































909 


GGA 


GGA 


TTC 


ACT 


GTA 


CAG 


AAT 


GAA 


GCC 


AAC 


AAA 


TAC 


CAG 


ATC 


TCA 


GTG 


7124 


910 


Gly 


Gly 


Phe 


Thr 


Val 


Gin 


Asn 


Glu 


Ala 


Asn 


Lys 


Tyr 


Gin 


He 


Ser 


val 




911 








360 










365 










370 








912 




































913 


AAC 


AAA 


TAC 


AGA 


GGA 


ACA 


GCC 


GGT 


AAT 


GCC 


CTC 


ATG 


GAT 


GGA 


GCA 


TCT 


7172 


914 


Asn 


Lys 


Tyr 


Arg 


Gly 


Thr 


Ala 


Gly 


Asn 


Ala 


Leu 


Met 


Asp 


Gly 


Ala 


Ser 




915 






375 










380 










385 










916 




































917 


CAG 


CTG 


ATG 


GGA 


GAA 


AAC 


AGG 


ACC 


ATG 


ACC 


ATT 


CAC 


AAC 


GGC 


ATG 


TTC 


7220 


918 


Gin 


Leu 


Met 


Gly 


Glu 


Asn 


Arg 


Thr 


Met 


Thr 


He 


His 


Asn 


Gly 


Met 


Phe 





PAGE: 19 RAW SEQUENCE LISTING DATE: 03/23/94 

PATENT APPLICATION US/08/206,176 TIME: 12:15:04 

INPUT SET: S1765.raw 



919 390 395 400 

920 

921 TTC AGC ACG TAT GAC AGA GAC AAT GAC GGC TG GTATGTGTGG 7262 

922 Phe Ser Thr Tyr Asp Arg Asp Asn Asp Gly Trp 

923 405 410 415 
924 

925 CACTCTTTGC TCCTGCTTTA AAAATCACAC TAATATCATT ACTCAGAATC ATTAACAATA 7322 
926 

927 TTTTTAATAG CTACCACTTC CTGGGCACTT ACTGTCAGCC ACTGTCCTAA GCTCTTTATG 7382 
928 

929 CATCACTCGA AAGCATTTCA ACTATAAGGT AGACATTCTT ATTCTCATTT TACAGATGAG 7442 
930 

931 ATTTAGAGAG ATTACGTGAT TTGTCCAATG TCACACAACT ACCCAGAGAT AAAACTAGAA 7502 
932 

933 TTTGAGCACA GTTACTTTCT GAATAATGAG CATTTAGATA AATAC CTAT A TCTCTATATT 7562 
934 

935 CTAAAGTGTG TGTGAAAACT TTCATTTTCA TTTCCAGGGT TCTCTGATAC TAAGGGTTGT 7622 
936 

937 AAAAGCTATT ATTCCAGTAT AAAGTAACAA ACACAGTCCC TAGATGGATT GCCACAAAGG 7682 
938 

93 9 CCCAGTTATC TCTCTTTCTT GCTATAGGGC ACAGGAGGTC TTTGGTGTAT TAGTGTGACT 7742 
940 

941 CTATGTATAG CACCCAAAGG AAAGACTACT GTGCACACGA GTGTAGCAGT CTTTTATGGG 7802 
942 

943 TAATCTGCAA AACGTAACTT GACCACCGTA GTTCTGTTTC TAATAACGCC AAACACATTT 7862 
944 

945 TCTTTCAG G TTA ACA TCA GAT CCC AGA AAA CAG TGT TCT AAA GAA GAC 7910 

946 Leu Thr Ser Asp Pro Arg Lys Gin Cys Ser Lys Glu Asp 

947 420 425 
948 

949 GGT GGT GGA TGG TGG TAT AAT 

950 Gly Gly Gly Trp Trp Tyr Asn 

951 430 435 
952 

953 AGA TAC TAC TGG GGT GGA CAG 

954 Arg Tyr Tyr Trp Gly Gly Gin 

955 445 450 
956 

957 ACA GAT GAT GGT GTA GTA TGG 

958 Thr Asp Asp Gly Val Val Trp 

959 465 
960 

961 ATG AGG AAG ATG AGT ATG AAG 

962 Met Arg Lys Met Ser Met Lys 

963 480 
964 

965 TACGTAGATT TTTGCTCTTC TGTATGTGAC AACATTTTTG TACATTATGT TATTGGAATT 8169 
966 

967 TTCTTTCATA CATTATATTC CTCTAAAACT CTCAAGCAGA CGTGAGTGTG ACTTTTTGAA 8229 
968 

969 AAAAGTATAG GATAAATTAC ATTAAAATAG CACATGATTT TCTTTTGTTT TCTTCATTTC 8289 



AGA TGT CAT GCA GCC AAT CCA AAC GGC 7958 
Arg Cys His Ala Ala Asn Pro Asn Gly 
440 

TAC ACC TGG GAC ATG GCA AAG CAT GGC 8006 
Tyr Thr Trp Asp Met Ala Lys His Gly 
455 460 

ATG AAT TGG AAG GGG TCA TGG TAC TCA 8054 
Met Asn Trp Lys Gly Ser Trp Tyr Ser 
470 475 

ATC AGG CCC TTC TTC CCA CAG CAA TAGTCCCCAA 8109 
lie Arg Pro Phe Phe Pro Gin Gin 
485 490 



PAGE: 20 



RAW SEQUENCE LISTING 

PATENT APPLICATION US/08/206,176 



DATE: 03/23/94 
TIME: 12:15:11 



INPUT SET; S1765.raw 



970 
971 
972 
973 
974 
975 
976 
977 
978 
979 
980 
981 
982 
983 
984 
985 
986 
987 
988 
989 
990 
991 
992 
993 
994 
995 
996 
997 
998 
999 
1000 
1001 
1002 
1003 
1004 
1005 
1006 
1007 
1008 
1009 
1010 
1011 
1012 
1013 
1014 
1015 
1016 
1017 
1018 
1019 
1020 



TCTTGCTCAC 
AGTTCTAGTT 
GGAAAACCAT 
ATTTATGTAG 
GTTGCAATAA 
CACTTGTAAG 
TTTGAAGATT 
ACTTTGGGAG 
ATGGTGAAAC 
TGTAGGTCCC 



CCAAGAAGTA 
GATTGCGAGA 
GACGGAAAGG 
GATCTGTCAA 
GTTAATGTTT 
GAAGGAGAAG 
TAAAATCTGA 
GCTGAGGCGG 
CCCATCTCTA 
AGCTAGCCTG 



ACAAAAGTAT 
ATTTTCAAAT 
AAAAACTGAT 
AGAAAACTTC 
TCTTGTTTTG 
CGTTCACAAC 
CTCTAGGACG 
GCGGTCACAA 
CTAAAAATAC 
TGAGGTGGAG 



AGTTTTGACA 
AAGGAAGAGG 
GTTTAAAAGT 
CAAAAAGATT 
TAATCCACAC 
CTCAAATAGC 
GGCACGGTGG 
GGTCAGGAGT 
AAAAATTAGC 
ATTGCATTGA 



GAGTTGGTGT 
GGTCTTTTAT 
CCACTTTTAA 
TATTAATTAA 
ATTCAATGAG 
TAATAAACCG 
CTCACGACTA 
TCAAGACCAG 
CAGGCGTGGT 
GCCAAGATC 



TCATAATTTC 
CCTTGTCGTA 
AACTATATTT 
ACCAGACTCT 
TTAGGCTTTG 
GTCTTGAATA 
TAATCCCAAC 
CCTGACCAAT 
GGCAGGTGCC 



(2) INFORMATION FOR SEQ ID NO : 4 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 491 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:4: 

Met Lys Arg Met Val Ser Trp Ser Phe His Lys Leu 
15 10 

His Leu Leu Leu Leu Leu Leu Cys Val Phe Leu Val 
20 25 

Val Asn Asp Asn Glu Glu Gly Phe Phe Ser Ala Arg 
35 40 



Lys Thr Met Lys 
15 

Lys Ser Gin Gly 
30 

Gly His Arg Pro 
45 



Leu Asp Lys Lys Arg Glu Glu Ala Pro Ser Leu Arg Pro Ala Pro Pro 
50 55 60 



Pro lie Ser Gly Gly Gly Tyr Arg Ala Arg Pro Ala 
65 70 75 

Thr Gin Lys Lys Val Glu Arg Lys Ala Pro Asp Ala 
85 90 



Lys Ala Ala Ala 
80 

Gly Gly Cys Leu 
95 



8349 
8409 
8469 
8529 
8589 
8649 
8709 
8769 
8829 
8878 



# 



PAGE: 21 RAW SEQUENCE LISTING DATE: 03/23/94 

PATENT APPLICATION US/08/206,176 TIME: 12:15:18 

INPUT SET: S1765.raw 



1021 His Ala Asp Pro Asp Leu Gly Val Leu Cys Pro Thr Gly Cys Gin Leu 

1022 100 105 110 
1023 

1024 Gin Glu Ala Leu Leu Gin Gin Glu Arg Pro lie Arg Asn Ser Val Asp 

1025 115 120 125 
1026 

1027 Glu Leu Asn Asn Asn Val Glu Ala Val Ser Gin Thr Ser Ser Ser Ser 

1028 130 135 140 
1029 

1030 Phe Gin Tyr Met Tyr Leu Leu Lys Asp Leu Trp Gin Lys Arg Gin Lys 

1031 145 150 155 160 
1032 

1033 

1034 Gin Val Lys Asp Asn Glu Asn Val Val Asn Glu Tyr Ser Ser Glu Leu 

1035 165 170 175 
1036 

103 7 Glu Lys His Gin Leu Tyr lie Asp Glu Thr Val Asn Ser Asn lie Pro 

1038 180 185 190 

1039 

1040 Thr Asn Leu Arg Val Leu Arg Ser lie Leu Glu Asn Leu Arg Ser Lys 

1041 195 200 205 
1042 

1043 lie Gin Lys Leu Glu Ser Asp Val Ser Ala Gin Met Glu Tyr Cys Arg 

1044 210 215 220 
1045 

1046 Thr Pro Cys Thr Val Ser Cys Asn lie Pro Val Val Ser Gly Lys Glu 

1047 225 230 235 240 
1048 

1049 Cys Glu Glu lie lie Arg Lys Gly Gly Glu Thr Ser Glu Met Tyr Leu 

1050 245 250 255 
1051 

1052 lie Gin Pro Asp Ser Ser Val Lys Pro Tyr Arg Val Tyr Cys Asp Met 

1053 260 265 270 
1054 

1055 Asn Thr Glu Asn Gly Gly Trp Thr Val lie Gin Asn Arg Gin Asp Gly 

1056 275 280 285 
1057 

1058 Ser Val Asp Phe Gly Arg Lys Trp Asp Pro Tyr Lys Gin Gly Phe Gly 

1059 290 295 300 
1060 

1061 Asn Val Ala Thr Asn Thr Asp Gly Lys Asn Tyr Cys Gly Leu Pro Gly 

1062 305 310 315 320 
1063 

1064 Glu Tyr Trp Leu Gly Asn Asp Lys lie Ser Gin Leu Thr Arg Met Gly 

1065 325 330 335 
1066 

1067 Pro Thr Glu Leu Leu lie Glu Met Glu Asp Trp Lys Gly Asp Lys Val 

1068 340 345 350 
1069 

1070 Lys Ala His Tyr Gly Gly Phe Thr Val Gin Asn Glu Ala Asn Lys Tyr 

1071 355 360 365 



PAGE: 22 



RAW SEQUENCE LISTING 

PATENT APPLICATION US/08/206,176 



DATE: 03/23/94 
TIME: 12:15:25 



INPUT SET: S1765.raw 



1072 
1073 
1074 
1075 
1076 
1077 
1078 
1079 
1080 
1081 
1082 
1083 
1084 
1085 
1086 
1087 
1088 
1089 
1090 
1091 
1092 
1093 
1094 
1095 
1096 
1097 
1098 
1099 
1100 
1101 
1102 
1103 
1104 
1105 
1106 
1107 
1108 
1109 
1110 
1111 
1112 
1113 
1114 
1115 
1116 
1117 
1118 
1119 
1120 
1121 
1122 



Gin lie Ser 
370 

Asp Gly Ala 
385 

Asn Gly Met 



Thr Ser Asp 



Trp Tyr Asn 
435 

Gly Gly Gin 
450 

Val Val Trp 
465 



Val Asn Lys Tyr 
375 

Ser Gin Leu Met 
390 

Phe Phe Ser Thr 
405 

Pro Arg Lys Gin 
420 

Arg Cys His Ala 



Arg Gly 
Gly Glu 
Tyr Asp 



Cys Ser 
425 

Ala Asn 
440 



Tyr Thr Trp Asp Met Ala 
455 

Met Asn Trp Lys Gly Ser 
470 



Ser Met Lys lie Arg Pro Phe Phe Pro 
485 



Thr Ala Gly Asn 
380 

Asn Arg Thr Met 
395 

Arg Asp Asn Asp 
410 

Lys Glu Asp Gly 



Pro Asn Gly Arg 
445 

Lys His Gly Thr 
460 

Trp Tyr Ser Met 
475 

Gin Gin 
490 



Ala Leu Met 

Thr He His 
400 

Gly Trp Leu 
415 

Gly Gly Trp 
430 

Tyr Tyr Trp 
Asp Asp Gly 



Arg Lys Met 
480 



(2) INFORMATION FOR SEQ ID NO : 5 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10564 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(vii) IMMEDIATE SOURCE: 

(B) CLONE: human fibrinogen gamma chain 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: j oin (1799 1876 , 1973.. 2017, 2207.. 2390, 2510 

..2603, 4211.. 4341, 4645.. 4778, 5758.. 5942, 7426 
..7703, 9342.. 9571) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 
CTACACACTT CTTGAAGGCA AAGGCAATGC TGAAGTCACC TTTCATGTTC AAATCATATT 



60 



PAGE: 23 



RAW SEQUENCE LISTING 

PATENT APPLICATION US/08/206,176 



DATE: 03/23/94 
TIME: 12:15:32 



INPUT SET: S1765.raw 



1123 
1124 
1125 
1126 
1127 
1128 
1129 
1130 
1131 
1132 
1133 
1134 
1135 
1136 
1137 
1138 
1139 
1140 
1141 
1142 
1143 
1144 
1145 
1146 
1147 
1148 
1149 
1150 
1151 
1152 
1153 
1154 
1155 
1156 
1157 
1158 
1159 
1160 
1161 
1162 
1163 
1164 
1165 
1166 
1167 
1168 
1169 
1170 
1171 
1172 
1173 



AAAAAGTTAG 
TTACATATTT 
ATATAGAAAG 
CATCCCATTC 
ACTACAACTA 
ACAATCTAAT 
TCTGAGGTAG 
TTGACAACTG 
TGGTGGTATG 
GAGAAGAAAA 
TACTGAATAT 
AATTAAAGTA 
AAAACTTATA 
TTAATAATAG 
CCCAAACACA 
TTTTGCCCCT 
TCTCTTCTCT 
TTCTCTGTGG 
AGCTCCCTGA 
CCAGGGTCAT 
GCCACAGCCA 
TAACTGATCC 
TCTTAAGGAT 
ACTGGAAGAA 
ATGCTTCACT 
AAAGGAGGAG 



CAAGATGTAA 
TCATTATATA 
TTTACAGAGA 
TTCTTCACAG 
TGTTAAGTAC 
GCAGTGGATA 
AGCTGTTTGA 
ATTAGCTGAG 
GTGATATGAT 
GCAACAAATC 
GAATATTTAT 
TTAGTATTTC 
ATAAGCTTGG 
TGTAACGCCC 
AGGATGCTGT 
TCCTTCAGCT 
GAGATAGCTT 
CAGATGGGGT 
GAAGTGAGAG 
TCATTCCTGG 
CAGATTAAAA 
CTAAGCTAAG 
AGACTAGGTT 
GTTGCATCAC 
AGCCTTTGCA 
CTTCAACCTG 



TTATCAGTGT 
TATTTTAGTA 
AAAATAAAGC 
AGGCAACTGA 
AGCTATATAT 
CAGCATGTAT 
AAGAATACTA 
TGGAAAAGAT 
TGACAATAAC 
TGGTTTTGAT 
CTTTGTTTTT 
TTTCAAACTG 
TTCCAGAGGA 
AGTGATAAAT 
AAGGGC CAAA 
ATGCCTCTAC 
GATGGAACCA 
GCCATGCCCA 
CCTATGAACA 
GAATAGTGAA 
ATAATAATGT 
AAAGTTCTTT 
TGCTTAGTTC 
ACAGCCTCCA 
GATAATTTTG 
TGTGCAAAAT 



ACTATGTAAA 
GATAATATTT 
CTTTTTTTCC 
TTCAAGTCAT 
GTTAGATGCC 
ACATATAATA 
ATACTTAAAT 
GGATGAGAAA 
TGCTAAGTCA 
GTGTTCACTT 
GAGTCAATAA 
GAGGCATTTC 
AGAAATGAGG 
CTCAATAGGC 
CAGAAATGAT 
TCTCCTTTAG 
CAGGAACAAT 
CCTTCAGACA 
TGGTTGACAC 
CTGGGACATG 
TAACTGATCC 
TGGTAATTCA 
GAGGTCATAT 
GGACTGCCCT 
GATCAGAGAA 
CTGGGAACCT 



TCTTTGTGAA 
ATATACATTC 
AATCCTGTCC 
TACATAGTTA 
GTAGCCACAG 
TAAGGTTGCT 
GTTTAATTCA 
GATTGTGAGA 
GAGAGGGATA 
TGTTATAATT 
ATATACCTTT 
TCCCACTAAC 
GATAACCAAA 
AGTGATGACA 
GGCCCCTCCC 
ATACAAGGGA 
GAAGTGGGCT 
AAGGGAAGAT 
AGAGGGACAG 
GGGGAAGTCA 
CTAGGCTAAA 
GGTGATGGCA 
CTGTTTGCTC 
CCTCCTCACA 
AAAACCTTGA 
GACAGTATAG 



TGATCAATAA 
AACATTCTAA 
TCCACCTCTG 
TTGAGTGTTA 
AAATCAGTTT 
ACAAATGCTA 
ACTGACTTGA 
CTTAATTGGC 
TATTAAGGAG 
ATTGATTATT 
GTAAAGACAG 
ATATTTCATC 
AATAGAGACA 
GACATGTTTT 
CAGCACCTCA 
GGTGGATTTT 
CCTGGCTCTT 
TGAGCTCAAA 
GAATGTATTT 
GTCTCCTCCT 
ATAATAGTGT 
GCAGGACCCA 
TCAGCCATGT 
GCAATGGATA 
GCTGGGCCAA 
GTTGGGGGCC 



120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 



PAGE: 24 



RAW SEQUENCE LISTING 

PATENT APPLICATION US/08/206,176 



DATE: 03/23/94 
TIME: 12:15:38 



INPUT SET: SI 765. raw 



1174 
1175 
1176 
1177 
1178 
1179 
1180 
1181 
1182 
1183 
1184 
1185 
1186 
1187 
1188 
1189 
1190 
1191 
1192 
1193 
1194 
1195 
1196 
1197 
1198 
1199 
1200 
1201 
1202 
1203 
1204 
1205 
1206 
1207 
1208 
1209 
1210 
1211 
1212 
1213 
1214 
1215 
1216 
1217 
1218 
1219 
1220 
1221 
1222 
1223 
1224 



AGGATGAGGA AAAAGGAACG GGAAAGACCT GCCCACCCTT CTGGTAAGGA GGCCCCGTGA 1680 

TCAGCTCCAG CCATTTGCAG TCCTGGCTAT CCCAGGAGCT TACATAAAGG GACAATTGGA 1740 

GCCTGAGAGG TGACAGTGCT GACACTACAA GGCTCGGAGC TCCGGGCACT CAGACATC 1798 

ATG AGT TGG TCC TTG CAC CCC CGG AAT TTA ATT CTC TAC TTC TAT GCT 1846 
Met Ser Trp Ser Leu His Pro Arg Asn Leu lie Leu Tyr Phe Tyr Ala 
15 10 15 

CTT TTA TTT CTC TCT TCA ACA TGT GTA GCA GTAAGTGTGC TCTTCACAAA 1896 
Leu Leu Phe Leu Ser Ser Thr Cys Val Ala 
20 25 

ACGTTGTTTA AAATGGAAAG CTGGAAAATA AAACAGATAA TAAACTAGTG AAATTTTCGT 1956 

ATTTTTTCTC TTTTAG TAT GTT GCT ACC AGA GAC AAC TGC TGC ATC TTA 2 005 

Tyr Val Ala Thr Arg Asp Asn Cys Cys lie Leu 
30 35 

GAT GAA AGA TTC GTAAGTAGTT TTTATGTTTC TCCCTTTGTG TGTGAACTGG 2057 
Asp Glu Arg Phe 
40 

AGAGGGGCAG AGGAATAGAA ATAATTCCCT CATAAATATC ATCTGGCACT TGTAACTTTT 2117 

TAAAAACATA GTCTAGGTTT TACCTATTTT TCTTAATAGA TTTTAAGAGT AGCATCTGTC 2177 

TACATTTTTA ATCACTGTTA TATTTTCAG GGT AGT TAT TGT CCA ACT ACC TGT 223 0 

Gly Ser Tyr Cys Pro Thr Thr Cys 
45 

GGC ATT GCA GAT TTC CTG TCT ACT TAT CAA ACC AAA GTA GAC AAG GAT 2278 
Gly lie Ala Asp Phe Leu Ser Thr Tyr Gin Thr Lys Val Asp Lys Asp 
50 55 60 65 

CTA CAG TCT TTG GAA GAC ATC TTA CAT CAA GTT GAA AAC AAA ACA TCA 2326 
Leu Gin Ser Leu Glu Asp lie Leu His Gin Val Glu Asn Lys Thr Ser 
70 75 80 

GAA GTC AAA CAG CTG ATA AAA GCA ATC CAA CTC ACT TAT AAT CCT GAT 2374 
Glu Val Lys Gin Leu lie Lys Ala lie Gin Leu Thr Tyr Asn Pro Asp 
85 90 95 

GAA TCA TCA AAA CCA A GTGAGAAAAT AAAGACTACT GACCAAAAAA 2420 
Glu Ser Ser Lys Pro 
100 

TAATAATAAT AATCTGTGAA GTTCTTTTGC TGTTGTTTTA GTTGTTCTAT TTGCTTAAGG 2480 
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1225 
1226 
1227 
1228 
1229 
1230 
1231 
1232 
1233 
1234 
1235 
1236 
1237 
1238 
1239 
1240 
1241 
1242 
1243 
1244 
1245 
1246 
1247 
1248 
1249 
1250 
1251 
1252 
1253 
1254 
1255 
1256 
1257 
1258 
1259 
1260 
1261 
1262 
1263 
1264 
1265 
1266 
1267 
1268 
1269 
1270 
1271 
1272 
1273 
1274 
1275 



ATTTTTATGT CTCTGATCCT ATATTACAG AT ATG ATA 

Asn Met lie 
105 

AAG TCC AGG ATA ATG TTA GAA GAA ATT ATG AAA 
Lys Ser Arg lie Met Leu Glu Glu lie Met Lys 
115 120 



GAC GCT GCT ACT TTG 
Asp Ala Ala Thr Leu 
110 

TAT GAA GCA TCG ATT 
Tyr Glu Ala Ser lie 
125 



TTA ACA CAT GAC TCA AGT ATT CG GTAAGGATTT TTGTTTTAAT TTGCTCTGCA 
Leu Thr His Asp Ser Ser lie Arg 
130 

AGACTGATTT AGTTTTTATT TAATATTCTA TACTTGAGTG AAAGTAATTT TTAATGTGTT 
TTCCCCATTT ATAATATCCC AGTGACATTA TGCCTGATTA TGTTGAGCAT AGTAGAGATA 
GAAGTTTTTA GTGCAATATA AATTATACTG GGTTATAATT GCTTATTAAT AATCACATTG 
AAGAAAGATG TTCTAGATGT CTTCAAATGC TAGTTTGACC ATATTTATCA AAAATTTTTT 
CCCCATCCCC CATTTATCTT ACAACATAAA ATCAATCTCA TAGGAATTTG GGTGTTGAAA 
ATAAAATCCT CTTTATAAAA ATGCTGACAA ATTGGTGGTT AAAAAAATTA GCAAGCAGAG 
GCATAGTAAG GATTTTGGCT CCTAAAGTAA ATTATATTGA ATGTGGAGCA GGAAGAAACA 
TGTCTTGAGA GACTAAGTGT GGCAAATATT GCAAAGCTCA TATTGATCAT TGCAGAATGA 
ACCTGCATAG TCTCTTCCCT TCATTTGGAA GTGAATGTCT CTGTTAAAGC TTCTCAGGGA 
CTCATAAACT TTCTGAACAT AAGGTCTCAG ATACAGTTTT AATATTTTTC CCCAATTTTT 
TTTTCTGAAT TTTTCTCAAA GCAGCTTGAG AAATTGAGAT AAATAGTAGC TAGGGAGAAG 
TGGCCCAGGA AAGATTTCTC CTCTTTTTGC TATCAGAGGG CCCTTGTTAT TATTGTTATT 
ATTATTACTT GCATTATTAT TGTCCATCAT TGAAGTTGAA GGAGGTTATT GTACAGAAAT 
TGCCTAAGAC AAGGTAGAGG GAAAACGTGG ACAAATAGTT TGTCTACCCT TTTTTACTTC 
AAAGAAAGAA CGGTTTATGC ATTGTAGACA GTTTTCTATC ATTTTTGGAT ATTTGCAAGC 
CACCCTGTAA GTAACTACAA AAGGAGGGTT TTTACTTCCC CCAGTCCATT CCCAAAGCTA 
TGTAACCAGA AGCATTAAAG AAGAAAGGGG AAGTATCTGT TGTTTTATTT TACATACAAT 
AACGTTCCAG ATCATGTCCC TGTGTAAGTT ATATTTTAGA TTGAAGCTTA TATGTATAGC 
CTCAGTAGAT CCACAAGTGA AAGGTATACT CCTTCAGCAC ATGTGAATTA CTGAACTGAG 
CTTTTCCTGC TTCTAAAGCA TCAGGGGGTG TTCCTATTAA CCAGTCTCGC CACTCTTGCA 



2532 

2580 

2633 

2693 
2753 
2813 
2873 
2933 
2993 
3053 
3113 
3173 
3233 
3293 
3353 
3413 
3473 
3533 
3593 
3653 
3713 
3773 
3833 
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1276 

1277 GGTTGCTATC TGCTGTCCCT TATGCATAAA GTAAAAAGCA AAATGTCAAT GACATTTGCT 38 93 
1278 

1279 TATTGACAAG GACTTTGTTA TTTGTGTTGG GAGTTGAGAC AATATGCCCC ATTCTAAGTA 3 953 
1280 

1281 AAAAGATTCA GGTCCACATT GTATTCCTGT TTTAATTGAT TTTTTGATTT GTTTTTCTTT 4013 
1282 

1283 TTCAAAAAGT TTATAATTTT AATTCATGTT AATTTAGTAA TATAATTTTA CATTTTCCTC 4073 
1284 

1285 AAGAATGGAA TAATTTATCA GAAAGCACTT CTTAAGAAAA TACTTAGCAG TTTCCAAAGA 4133 
1286 

1287 AAATATAAAA TTACTCTTCT GAAAGGAATA CTTATTTTTG TCTTCTTATT TTTGTTATCT 4193 
1288 

1289 TATGTTTCTG TTTGTAG A TAT TTG CAG GAA ATA TAT AAT TCA AAT AAT CAA 4244 

1290 Tyr Leu Gin Glu lie Tyr Asn Ser Asn Asn Gin 

1291 135 140 145 
1292 

1293 AAG ATT GTT AAC CTG AAA GAG AAG GTA GCC CAG CTT GAA GCA CAG TGC 4292 

1294 Lys lie Val Asn Leu Lys Glu Lys Val Ala Gin Leu Glu Ala Gin Cys 

1295 150 155 160 
1296 

1297 CAG GAA CCT TGC AAA GAC ACG GTG CAA ATC CAT GAT ATC ACT GGG AAA G 4341 

1298 Gin Glu Pro Cys Lys Asp Thr Val Gin lie His Asp lie Thr Gly Lys 

1299 165 170 175 
1300 

13 01 GTAACTGATG AAGGTTATAT TGGGATTAGG TTCATCAAAG TAAGTAATGT AAAGGAGAAA 4401 
1302 

13 03 GTATGTACTG GAAAGTATAG GAATAGTTTA GAAAGTGGCT ACCCATTAAG TCTAAGAATT 4461 
1304 

1305 TCAGTTGTCT AGACCTTTCT TGAATAGCTA AAAAAAACAG TTTAAAAGGA ATGCTGATGT 4521 
1306 

13 07 GAAAAGTAAG AAAATTATTC TTGGAAAATG AATAGTTTAC TACATGTTAA AAGCTATTTT 4581 
1308 

1309 TCAAGGCTGG CACAGTCTTA CCTGCATTTC AAACCACAGT AAAAGTCGAT TCTCCTTCTC 4641 
1310 

1311 TAG AT TGT CAA GAC ATT GCC AAT AAG GGA GCT AAA CAG AGC GGG CTT 4688 

1312 Asp Cys Gin Asp lie Ala Asn Lys Gly Ala Lys Gin Ser Gly Leu 

1313 180 185 190 
1314 

1315 TAC TTT ATT AAA CCT CTG AAA GCT AAC CAG CAA TTC TTA GTC TAC TGT 4736 

1316 Tyr Phe lie Lys Pro Leu Lys Ala Asn Gin Gin Phe Leu Val Tyr Cys 

1317 195 200 205 
1318 

1319 GAA ATC GAT GGG TCT GGA AAT GGA TGG ACT GTG TTT CAG AAG 4778 

1320 Glu lie Asp Gly Ser Gly Asn Gly Trp Thr Val Phe Gin Lys 

1321 210 215 220 
1322 

1323 GTAATTTTTT CCCCACCATG TGTATTTAAT AAATTCCTAC ATTGTTTCTG CCATATGGCA 4838 
1324 

1325 GATACTTTTC TAAGCACCTT GTGAACCGTA GCTCATTTAA TCCTTGCAAT AGCCCTAAGA 48 98 
1326 
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132 7 GGAAGGTACT TCTGTTACTC CTATTTACAG AAAAGGAAAC TGAGGCACAC AAGGTTAAAT 4958 
1328 

132 9 AACTTGCCCA AGACCACATA ACTAATAAGC AACAGAGTCA GCATTTGAAC CTAGGCAGTA 5018 
1330 

1331 TAGTTTCAGA GTTTGTGACT TGACTCTATA TTGTACTGGC ACTGACTTTG TAGATTCATG 5078 
1332 

1333 GTGGCACATA ATCATAGTAC CACAGTGACA AATAAAAAGA AGGAAACTCT TTTGTCAGGT 5138 
1334 

1335 AGGTCAAGAC CTGAGGTTTC CCATCACAAG ATGAGGAAGC CCAACACCAC CCCCCACCAC 5198 
1336 

1337 CCCACCACCA TCACCACCCT TTCACACACC AGAGGATACA CTTGGGCTGC TCCAAGACAA 5258 
1338 

1339 GGAACCTGTG TTGCATCTGC CACTTGCTGA TACCCACTAG GAATCTTGGC TCCTTTACTT 5318 
1340 

1341 TCTGTTTACC TCCCACCACT GTTATAACTG TTTCTACAGG GGGCGCTCAG AGGGAATGAA 5378 
1342 

1343 TGGTGGAAGC ATTAGTTGCC AGACACCGAT TGAGCAATGG GTTCCATCAT AAGTGTAAGA 5438 
1344 

1345 ATCAGTAATA TCCAGCTAGA GTTCTGAAGT CGTCTAGGTG TCTTTTTAAT ATTACCACTC 5498 
1346 

1347 ATTTAGAATT TATGATGTGC CAGAAACCCT CTTAAGTATT TCTCTTATAT TCTCTCTCAT 5558 
1348 

1349 GATCCTTGCA GCAACCCTAA GAAGTAACCA TCATTTTTCC TATTTGATAC ATGAGGAAAC 5618 
1350 

1351 TGAGGTAGCT TGGCCAAGAT CACTTAGTTG GGAGTTGATA GAACCAGTGC TCTGTATTTT 5678 
1352 

1353 TGACAAAATG TTGACAGCAT TCTCTTTACA TGCATTGATA GTCTATTTTC TCCTTTTGCT 5738 
1354 

1355 CTTGCAAATG TGTAATTAG AGA CTT GAT GGC AGT GTA GAT TTC AAG AAA AAC 5790 

1356 Arg Leu Asp Gly Ser Val Asp Phe Lys Lys Asn 

1357 225 230 
1358 

1359 TGG ATT CAA TAT AAA GAA GGA TTT GGA CAT CTG TCT CCT ACT GGC ACA 5838 

1360 Trp lie Gin Tyr Lys Glu Gly Phe Gly His Leu Ser Pro Thr Gly Thr 

1361 235 240 245 
1362 

1363 ACA GAA TTT TGG CTG GGA AAT GAG AAG ATT CAT TTG ATA AGC ACA CAG 5886 

1364 Thr Glu Phe Trp Leu Gly Asn Glu Lys lie His Leu lie Ser Thr Gin 

1365 250 255 260 265 
1366 

1367 TCT GCC ATC CCA TAT GCA TTA AGA GTG GAA CTG GAA GAC TGG AAT GGC 5934 

1368 Ser Ala lie Pro Tyr Ala Leu Arg Val Glu Leu Glu Asp Trp Asn Gly 

1369 270 275 280 
1370 

13 71 AGA ACC AG GTACTGTTTT GAAATGACTT CCAACTTTTT ATTGTAAAGA 5982 

13 72 Arg Thr Ser 

1373 

1374 

1375 TTGCCTGGAA TGTGCACTTT CCAACTATCA ATAGACAATG GCAAATGCAG CCTGACAAAT 6042 
1376 

1377 GCAAACAGCA CATCCAGCCA CCATTTTCTC CAGGAGTCTG TTTGGTTCTT GGGCAATCCA 6102 
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1378 
1379 
1380 
1381 
1382 
1383 
1384 
1385 
1386 
1387 
1388 
1389 
1390 
1391 
1392 
1393 
1394 
1395 
1396 
1397 
1398 
1399 
1400 
1401 
1402 
1403 
1404 
1405 
1406 
1407 
1408 
1409 
1410 
1411 
1412 
1413 
1414 
1415 
1416 
1417 
1418 
1419 
1420 
1421 
1422 
1423 
1424 
1425 
1426 
1427 
1428 



AAAAGGTAAA TTCTATTCAG GATGAATCTA AGTGTATTGG TACAATCTAA TTACCCTGGA 6162 

ACCATTCAGA GTAATAGCTA ATTACTGAAC TTTTAATCAG TCCCAGGAAT TGAGCATAAA 6222 

ATTATAATTT TATCTAGTCT AAATTACTAT TTCATGAAGC AGGTATTATT ATTAATCCCA 6282 

TTTTATAGAT TAACTTGCTC AAAGTCACAT TGCTGATAAG TGGTAGAGGT AGAATTCAGA 6342 

CTCAAGTAGT TTAACTTTAG AGCCTGTCCT CTTAACAACT ATCCTGGTTG AAAAGCAAAT 6402 

ACAGCCTCTT CAGACTTCTC AGTGCCTTGA TGGCCATTTA TTCTGTCAAA TCATGAGCTA 6462 

CCCTAAAAGT AAACCAGCTA GCTCTTTTGA TGATCTAGAG GCTTCTTTTT GCTTGAGATA 6522 

TTTGAAGGTT TTAAGCATTG TTACCTAATT AAAATGCAGA AAAATATCCA ACCCTCTTGT 6582 

TATGTTTAAG GAATAGTGAA ATATATTGTC TTCAAACACA TGGACTTTTT TTTATTGCTT 6642 

GGTTGGTTTT TAATCCAGAA AGTGCTATAG TCAGTAGACC TTCTTCTAGG AAAGGACCTT 6702 

CCATTTCCCA GCCACTGGAG ATTAGAAAAT AAGCTAAATA TTTTCTGGAA ATTTCTGTTC 6762 

ATTCATTAAG GCCCATCCTT TCCCCCACTC TATAGAAGTG TTGTCCACTT GCACAATTTT 6822 

TTCCAGGAAA GAATCTCTCT AACTCCTTCA GCTCACATGC TTTGGACCAC ACAGGGAAGA 6882 

CTTTGATTGT GTAATGCCCT CAGAAGCTCT CCTTCTTGCC ACTACCACAC TGATTTGAGG 6942 

AAGAAAATCC CTTTAGCACC TAACCCTTCA GGTGCTATGA GTGGCTAATG GAACTGTACC 7002 

TCCTTCAAGT TTTGTGCAAT AATTAAGGGT CACTCACTGT CAGATACTTT CTGTGATCTA 7062 

TGATAATGTG TGTGCAACAC ATAACATTTC AATAAAAGTA GAAAATATGA AATTAGAGTC 7122 

ATCTACACAT CTGGATTTGA TCTTAGAATG AAACAAGCAA AAAAGCATCC AAGTGAGTGC 7182 

AATTATTAGT TTTCAGAGAT GCTTCAAAGG CTTCTAGGCC CATCCCGGGA AGTGTTAATG 7242 

AGCTGTGGAC TGGTTCACAT ATCTATTGCC TCTTGCCAGA TTTGCAAAAA ACTTCACTCA 73 02 

ATGAGCAAAT TTCAGCCTTA AGAAACAAAG TCAAAAATTC CAAGGAAGCA TCCTACGAAA 7362 

GAGGGAACTT CTGAGATCCC TGAGGAGGGT CAGCATGTGA TGGTTGTATT TCCTTCTTCT 7422 

CAG T ACT GCA GAC TAT GCC ATG TTC AAG GTG GGA CCT GAA GCT GAC 7468 
Thr Ala Asp Tyr Ala Met Phe Lys Val Gly Pro Glu Ala Asp 
285 290 295 

AAG TAC CGC CTA ACA TAT GCC TAC TTC GCT GGT GGG GAT GCT GGA GAT 7516 
Lys Tyr Arg Leu Thr Tyr Ala Tyr Phe Ala Gly Gly Asp Ala Gly Asp 
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330 
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1435 


ACA 


TCC 


CAT AAT 


GGC 


ATG 


CAG 


TTC AGT 


ACC 


TGG 


GAC 


AAT 


GAC AAT GAT 


7612 


1436 


Thr 


Ser 


His 


i Asn 


Gly 


Met 


Gin 


Phe Ser 


Thr 


Trp 


Asp 


Asn 


Asp Asn Asp 




1437 










335 












340 










345 




1438 




































1439 


AAG 


TTT 


GAA GGC 


AAC 


TGT 


GCT 


GAA CAG 


GAT 


GGA 


TCT 


GGT 


TGG TGG ATG 


7660 


1440 


Lys 


Phe 


Glu Gly 


Asn 


Cys 


Ala 


Glu Gin 


Asp 


Gly 


Ser 


Gly 


Trp Trp Met 




1441 








350 










355 










360 




1442 




































1443 


AAC 


AAG 


TGT CAC 


GCT 


GGC 


CAT 


CTC AAT 


GGA 


GTT 


TAT 


TAC 


CAA G 


7703 


1444 


Asn 


Lys 


Cys 


i His 


Ala 


Gly 


His 


Leu Asn 


Gly 


Val 


Tyr 


Tyr 


Gin 




1445 






365 










370 










375 








1446 




































1447 


GTATGTTTTC 


CTTTCTTAGA 


TTCCAAGTTA 


ATGTATAGTG 


TATACTATTT 


TCATAAAAAA 


7763 


1448 




































1449 


TAATAAATAG 


ATATGAAGAA 


ATGAAGAATA 


ATTTATAAAG 


ATAGTAGGGA 


TTTTATCATG 


7823 


1450 . 




































1451 


TTCTTTATTT 


CAACTAAGTT 


CTTTGAAACT 


GGAAGTGGAT 


AATACCAAGT 


TCATGCCTAA 


7883 


1452 




































1453 


AATTAGCCCT 


TCTAAAGAAA 


TCCACCTGCT 


GCAAAATATC 


CAGTAGTTTG 


GCATTATATG 


7943 


1454 




































1455 


TGAAACTATC 


ACCATCATAG 


CTGGCACTGT 


GGGTTGTGGG 


ATCTCCTTTA 


GACATACAAC 


8003 


1456 




































1457 


ATAAATGATC 


TGGATGGATT 


AACATTACTA 


CATGGATGCT 


TGTTGACACA 


TTAACCTGGC 


8063 


1458 




































1459 


TTCCCATGAG 


CTTTGTGTCA 


GATACACGCA 


GTGAACAGGT 


GTTTGGAGGA 


ACAGAATAAA 


8123 


1460 




































1461 


GAGAAGGCAA 


GCACTGGTAA 


GGGCAGGGGT 


TTGTGAAAGC 


TTGAGAGAAG 


AGACCAGTCT 


8183 


1462 




































1463 


GAGGACAGTA 


GACACTTATT 


TTAGGATGGG 


GGTTGGATGA 


GGAGGCTATA 


GTTTGCTATA 


8243 


1464 




































1465 


AGCTTGGAAT 


GGTTTGGAAC 


ACTGGTTTCA 


CTCACCTACC 


CAGCAGTTAT 


GTGTGGGGAA 


8303 


1466 




































1467 


GCCTTACCGA 


TGCTAAAGGA 


TCCATGTTAC 


AATAATGGCA 


TTATTTGGAA 


ATCCCAGTGG 


8363 


1468 




































1469 


TATTCCATGA 


ATAAAACCAC 


TATGAAGATA 


ATCCCACTCA 


ACAGACTCTC 


CGTTGGAGAA 


8423 


1470 




































1471 


GGACAGCAAC 


ACCACCCTGG 


GAAAGCCAAA 


CAGTCAGACC 


AGACCTGTTT 


AGCATCAGTA 


8483 


1472 




































1473 


GGACTTCCCT 


ACCATATCTG 


CTGGGTAGAT 


GAGTGAAACC 


AGTGTTCCAA 


ACCACTCCGG 


8543 


1474 




































1475 


GCTTGTAGCA 


AACCATAGTC 


TCCTCATCTA 


CCAAGATGAG 


CAACCTTACC 


TCCTGATGTC 


8603 


1476 




































1477 


CTAGCCAATC 


ACCAACTAGG 


AAACTTTGCA 


CAGTTTATTT 


AAAGTAACAG 


TTTGATTTTC 


8663 


1478 




































1479 


ACAATATTTT 


TAAATTGGAG 


AAACATAACT 


TATCTTTGCA 


CTCACAAACC 


ACATAATGAG 


8723 



ft 
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1480 

1481 AAGAAACTCT AAGGGAAAAT GCTTGATCTG TGTGACCCGG GGCGCCATGC CAGAGCTGTA 8783 
1482 

1483 GTTCATGCCA GTGTTGTGCT CTGACAAGCC TTTTACAGAA TTACATGAGA TCTGCTTCCC 8843 
1484 

1485 TAGGACAAGG AGAAGGCAAA TCAACAGAGG CTGCACTTTA AAATGGAGAC ATAAAATAAC 8903 
1486 

1487 ATGCCAGAAC CATTTCCTAA AGCTCCTCAA TCAACCAACA AAATTGTGCT TTCAAATAAC 8963 
1488 

1489 CTGAGTTGAC CTCATCAGGA ATTTTGTGGC TCCTTCTCTT CTAACCTGCC TGAAGAAAGA 9023 
1490 

1491 TGGTCCACAG CAGCTGAGTC CGGGATGGAT AAGCTTAGGG ACAGAGGCCA ATTAGGGAAC 9083 
1492 

1493 TTTGGGTTTC TAGCCCTACT AGTAGTGAAT AAATTTAAAG TGTGGATGTG ACTATGAGTC 9143 
1494 

1495 ACAGCACAGA TGTTGTTTAA TAATATGTTT ATTTTATAAA TTGATATTTT AGGAATCTTT 9203 
1496 

1497 GGAGATATTT TCAGTTAGCA GATAATACTA TAAATTTTAT GTAACTGGCA ATGCACTTCG 9263 
1498 

1499 TAATAGACAG CTCTTCATAG ACTTGCAGAG GTAAAAAGAT TCCAGAATAA TGATATGTAC 9323 
1500 

1501 ATCTACGACT TGTTTTAG 
1502 
1503 
1504 

1505 GGT TAT GAT AAT GGC 

1506 Gly Tyr Asp Asn Gly 

1507 390 
1508 

1509 TCC ATG AAG AAA ACC 

1510 Ser Met Lys Lys Thr 

1511 405 
1512 

1513 ATT GGA GAA GGA CAG 

1514 He Gly Glu Gly Gin 

1515 420 
1516 

1517 CCA GAG CAC CCT GCG 

1518 Pro Glu His Pro Ala 

1519 440 
1520 

1521 GAT TTG TAGAAAATTA ACTGCTAACT TCTATTGACC CACAAAGTTT CAGAAATTCT 9621 

1522 Asp Leu 
1523 

1524 

1525 CTGAAAGTTT CTTCCTTTTT TCTCTTACTA TATTTATTGA TTTCAAGTCT TCTATTAAGG 9681 
1526 

1527 ACATTTAGCC TTCAATGGAA ATTAAAACTC ATTTAGGACT GTATTTCCAA ATTACTGATA 9741 
1528 

1529 TCAGAGTTAT TTAAAAATTG TTTATTTGAG GAGATAACAT TTCAACTTTG TTCCTAAATA 9801 
1530 



GT GGC ACT TAC TCA AAA GCA TCT ACT CCT AAT 9373 
Gly Gly Thr Tyr Ser Lys Ala Ser Thr Pro Asn 
380 385 

ATT ATT TGG GCC ACT TGG AAA ACC CGG TGG TAT 9421 
He He Trp Ala Thr Trp Lys Thr Arg Trp Tyr 
395 400 

ACT ATG AAG ATA ATC CCA TTC AAC AGA CTC ACA 9469 
Thr Met Lys He He Pro Phe Asn Arg Leu Thr 
410 415 

CAA CAC CAC CTG GGG GGA GCC AAA CAG GTC AGA 9517 
Gin His His Leu Gly Gly Ala Lys Gin Val Arg 
425 430 435 

GAA ACA GAA TAT GAC TCA CTT TAC CCT GAG GAT 9565 
Glu Thr Glu Tyr Asp Ser Leu Tyr Pro Glu Asp 
445 450 
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1531 
1532 
1533 
1534 
1535 
1536 
1537 
1538 
1539 
1540 
1541 
1542 
1543 
1544 
1545 
1546 
1547 
1548 
1549 
1550 
1551 
1552 
1553 
1554 
1555 
1556 
1557 
1558 
1559 
1560 
1561 
1562 
1563 
1564 
1565 
1566 
1567 
1568 
1569 
1570 
1571 
1572 
1573 
1574 
1575 
1576 
1577 
1578 
1579 
1580 
1581 



TATAATAATA 
CTTCGTAAAT 
CATTTTAGAG 
TTTAATTCTT 
ATATATTCAC 
GGACTTTATC 
TTCACTTCAT 
ATGATAATTT 
TACCTTTATT 
AAC CATCAGT 
GTTACAAAGG 
AAAATACAGC 
CACAGCCACA 



AAATGATTGA 
TATTTTCATT 
TTTCAAATTC 
CTTTTTAAAT 
AGGCTGGAGA 
TGAACAGAGA 
TTTAAGAGCA 
ATCTACATGC 
GACCATTAAA 
AGTAATTGAG 
AAATAGCAAT 
AACCCAAACT 
CAACCAAGAT 



CTTTATTTGC 
ATATCAAATA 
CCAGGTATTT 
GGAGAAAATT 
CGTTTAAAAG 
GATATAATAT 
AAAGACCCCA 
ATTTCAATAA 
AAACCACCAC 
TCCTCATTTT 
TATGGCTTTT 
GACAATACTC 
GCATAGTATC 



ATTTTTATGA 
TTTTAGTATG 
TCCTTGTTTA 
ATGTCTTTTT 
ACCGTTTCAA 
TTTTCCTATT 
TGTTGAAAAC 
ACCTTTTGTT 
TTTTTGCCAA 
ATGCTAAATG 
GCCCTCTAGG 
TATACAAGAA 
CAAAGTGCAG 



CCACTTGTCA 
TACTTAATAA 
TTACCCCTAA 
AATATGGTTT 
AAGAGATTTA 
GGACAATGGA 
TCCATAACAG 
TCCTAAGACT 
TTTACCAATT 
TTATGCCTAA 
AGATACAGGA 
CATAATCACT 
CTG 



TTTATTTTGT 
AATAGGAGAA 
ATCATTCCTA 
TTGTTTTGTT 
CTTTTTTAAA 
CTTGCAAAGC 
TTTTATGCTG 
AGATACATGG 
ACAATTGGGC 
CTCTTTGGGA 
CAAATACAGG 
AAGCAGGAGT 



(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 453 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:6: 

Met Ser Trp Ser Leu His Pro Arg Asn Leu lie Leu Tyr Phe Tyr Ala 
15 10 15 

Leu Leu Phe Leu Ser Ser Thr Cys Val Ala Tyr Val Ala Thr Arg Asp 
20 25 30 

Asn Cys Cys lie Leu Asp Glu Arg Phe Gly Ser Tyr Cys Pro Thr Thr 
35 40 45 

Cys Gly lie Ala Asp Phe Leu Ser Thr Tyr Gin Thr Lys Val Asp Lys 
50 55 60 

Asp Leu Gin Ser Leu Glu Asp lie Leu His Gin Val Glu Asn Lys Thr 



9861 
9921 
9981 
10041 
10101 
10161 
10221 
10281 
10341 
10401 
10461 
10521 
10564 



PAGE: 32 RAW SEQUENCE LISTING DATE: 03/23/94 

PATENT APPLICATION US/08/206,176 TIME: 12:16:32 

INPUT SET: S1765.raw 



1582 65 70 75 80 

1583 

1584 Ser Glu Val Lys Gin Leu lie Lys Ala lie Gin Leu Thr Tyr Asn Pro 

1585 85 90 95 
1586 

1587 Asp Glu Ser Ser Lys Pro Asn Met He Asp Ala Ala Thr Leu Lys Ser 

1588 100 105 110 
1589 

1590 Arg He Met Leu Glu Glu He Met Lys Tyr Glu Ala Ser He Leu Thr 

1591 115 120 125 
1592 

1593 His Asp Ser Ser He Arg Tyr Leu Gin Glu He Tyr Asn Ser Asn Asn 

1594 130 135 140 
1595 

1596 Gin Lys He Val Asn Leu Lys Glu Lys Val Ala Gin Leu Glu Ala Gin 

1597 145 150 155 160 
1598 

1599 Cys Gin Glu Pro Cys Lys Asp Thr Val Gin He His Asp He Thr Gly 

1600 165 170 175 
1601 

1602 Lys Asp Cys Gin Asp He Ala Asn Lys Gly Ala Lys Gin Ser Gly Leu 

1603 180 185 190 
1604 

1605 Tyr Phe He Lys Pro Leu Lys Ala Asn Gin Gin Phe Leu Val Tyr Cys 

1606 195 200 205 
1607 

1608 Glu He Asp Gly Ser Gly Asn Gly Trp Thr Val Phe Gin Lys Arg Leu 

1609 210 215 220 
1610 

1611 Asp Gly Ser Val Asp Phe Lys Lys Asn Trp He Gin Tyr Lys Glu Gly 

1612 225 230 235 240 
1613 

1614 Phe Gly His Leu Ser Pro Thr Gly Thr Thr Glu Phe Trp Leu Gly Asn 

1615 245 250 255 
1616 

1617 Glu Lys He His Leu He Ser Thr Gin Ser Ala He Pro Tyr Ala Leu 

1618 260 265 270 
1619 

1620 Arg Val Glu Leu Glu Asp Trp Asn Gly Arg Thr Ser Thr Ala Asp Tyr 

1621 275 280 285 
1622 

1623 Ala Met Phe Lys Val Gly Pro Glu Ala Asp Lys Tyr Arg Leu Thr Tyr 

1624 290 295 300 
1625 

1626 Ala Tyr Phe Ala Gly Gly Asp Ala Gly Asp Ala Phe Asp Gly Phe Asp 

1627 305 310 315 320 
1628 

1629 Phe Gly Asp Asp Pro Ser Asp Lys Phe Phe Thr Ser His Asn Gly Met 

1630 325 330 335 
1631 

1632 Gin Phe Ser Thr Trp Asp Asn Asp Asn Asp Lys Phe Glu Gly Asn Cys 



PAGE: 33 



RAW SEQUENCE LISTING 

PATENT APPLICATION US/08/206,176 



DATE: 03/23/94 
TIME: 12:16:39 



INPUT SET: S1765.raw 



1633 
1634 
1635 
1636 
1637 
1638 
1639 
1640 
1641 
1642 
1643 
1644 
1645 
1646 
1647 
1648 
1649 
1650 
1651 
1652 
1653 
1654 
1655 
1656 
1657 
1658 
1659 
1660 
1661 
1662 
1663 
1664 
1665 
1666 
1667 
1668 
1669 
1670 
1671 
1672 
1673 
1674 
1675 
1676 
1677 
1678 
1679 
1680 
1681 
1682 
1683 



340 



345 



350 

Cys His Ala Gly 
365 

Ser Lys Ala Ser 



Thr Trp Lys Thr 
400 

lie Pro Phe Asn 
415 

Gly Gly Ala Lys 
430 

Asp Ser Leu Tyr 
445 



Ala Glu Gin Asp Gly Ser Gly Trp Trp Met Asn Lys 
355 360 

His Leu Asn Gly Val Tyr Tyr Gin Gly Gly Thr Tyr 
370 375 380 

Thr Pro Asn Gly Tyr Asp Asn Gly lie lie Trp Ala 
385 390 395 

Arg Trp Tyr Ser Met Lys Lys Thr Thr Met Lys lie 
405 410 

Arg Leu Thr lie Gly Glu Gly Gin Gin His His Leu 
420 425 

Gin Val Arg Pro Glu His Pro Ala Glu Thr Glu Tyr 
435 440 

Pro Glu Asp Asp Leu 
450 

(2) INFORMATION FOR SEQ ID NO : 7 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10807 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(vii) IMMEDIATE SOURCE: 

(B) CLONE: ovine beta-lactoglobulin 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 7 : 

ACGCGTGTCG ACCTGCAGGT CAACGGATCT CTGTGTCTGT TTTCATGTTA GTACCACACT 60 

GTTTTGGTGG CTGTAGCTTT CAGCTACAGT CTGAAGTCAT AAAGCCTGGT ACCTCCAGCT 120 

CTGTTCTCTC TCAAGATTGT GTTCTGCTGT TTGGGTCTTT AGTGTCTCCA CACAATTTTT 180 

AGAATTGTTT GTTCTAGTTC TGTGAAAAAT GATGCTGGTA TTTTGATAAG GATTGCATTG 240 

AATCTGTAAA GCTACAGATA TAGTCATTGG GTAGTACAGT CACTTTAACA ATATTAACTC 3 00 

TTCACATCTG TGAGCATGAT ATATTTTCCC CCTCTATATC ATCTTCAATT CCTCCTATCA 360 

GTTTCTTTCA TTGCAGTTTT CTGAGTACAG GTCTTACACC TCCTTGGTTA GAGTCATTCC 420 
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RAW SEQUENCE LISTING 

PATENT APPLICATION US/08/206,176 



DATE: 03/23/94 
TIME: 12:16:46 



INPUT SET: S176S.raw 



1684 
1685 
1686 
1687 
1688 
1689 
1690 
1691 
1692 
1693 
1694 
1695 
1696 
1697 
1698 
1699 
1700 
1701 
1702 
1703 
1704 
1705 
1706 
1707 
1708 
1709 
1710 
1711 
1712 
1713 
1714 
1715 
1716 
1717 
1718 
1719 
1720 
1721 
1722 
1723 
1724 
1725 
1726 
1727 
1728 
1729 
1730 
1731 
1732 
1733 
1734 



TCAGTATTTT 
GATAGCTCAT 
TCCTGCAACA 
TAGCTTTTTG 
GGTGTCATGT 
TTGATTTCTT 
AAGAGTGGAC 
CATTAATTAT 
TTCCCTCTAT 
AAGTTTTTCC 
CTTCATTTTG 
ATTTGTGGAT 
TTTTTGAATT 
TATTGGCCTA 
AGAGTTTAGA 
TAACTCTTCT 
AGGGATGTGG 
AACAAGCCCC 
CAGAGAAAGC 
AATACAGCTG 
TTGATTCAAT 
GTCTTGGGAG 
TGGACATGCA 
AGGCTGGGGC 
AGCCCCACAA 



ATTCCTTTGA 
TGTTAGTGTA 
GATTTCTATG 
GTGACATCTT 
CATCTGCAAA 
TCTGTCTGAG 
ATCCTTGTCT 
AATGTTTACT 
ACCCACCTTG 
TGCATCTATT 
TTAATGATTT 
ACCTTTGTAT 
CACTTTGCTA 
AGAAAGGTTT 
AGCATTTCCT 
TTAAATGTTT 
GTTTGATCCC 
CAAGCTGCAA 
CCACATACAG 
TGAAGCCGTC 
TTCATTACTG 
ATTGTACATG 
TGGGAGCACA 
CCCAAGCAGC 
TTTCATTCTG 



TACAATTGTG 
TATATAGAAA 
TATTAATTTT 
GAGGATTTTC 
CAGTGGCAGT 
TACGACTAGG 
TATTTTTCTG 
GTGGGCTTGT 
TTGAGAGTTT 
GAGATGATTT 
CCATTCTTCA 
CCCTGGGATA 
ATATTCTGTT 
TGTCTGGTTT 
CCTCTTTGAT 
GGGGACTTCC 
TGGTCAGGGA 
CCACTGAGCT 
CAGGGAAGAC 
TGGTCCTGGA 
GTAACTGGTC 
CCTAGGAATG 
CAGCACCGAC 
ATGGCATCCT 
AGAAGTGATT 



AATGAGGTAA 
AGCAACAGAT 
GTATCCTGCT 
TGAAGAAAAT 
TTTCCTTCTT 
ATTCCCAATA 
ACCTTAGAGG 
CATATGTGGC 
TTATCATAAA 
TTACTCTTCA 
ATTTGTTAAC 
AACCTCACTT 
GGGTATTTTT 
TAGTATCAGG 
TTTTCGGAAT 
CTGGTGAGCC 
ACCATTAATA 
GCAACCGCTG 
CCAGCACAAC 
CTCCTGCTTG 
TGTTCATATT 
TGTCCGTTTC 
CAGCGAGACT 
AGAGTGTGTG 
CCTTGCTTCT 



TTTTCTTAGT 
TTCTATGTAT 
ACTTTACGGA 
GGCATGGTAT 
CCCTTCCAAC 
CTATACCGAA 
AAATGCTTTC 
CTTCATTATA 
AGTATGTTGA 
ATTCATTAAT 
GTGGTATATC 
GATCATGAGC 
GCATCTCTAT 
GTGATGCTGG 
AGTTTGAGTA 
GGTGGTTGAG 
AGATCCCACA 
CAGTGCCCAC 
CGGAAAAAGG 
AGGGAATTTT 
TTCTATTTCT 
TTCTAGGTTG 
CATGCTGGCT 
AAAGCCCACT 
GCACTTACAG 



TTCTCTTTCT 
TAATTTTGTA 
ATTCACTTAT 
GGTAGGACAA 
CTGGATTTCT 
TAAAAGTGGC 
AGTTTTTCAC 
TGGAGGTCTA 
ATTTTGTCAA 
GATTTTTATT 
ACATTGATTG 
TTTCAATGTA 
TCATCAATGA 
CCTCATAGAG 
GGATAGGTAT 
AATCCGCCTC 
TGCTGCAGGC 
AGGCCACGAC 
AGTTTGGTGG 
TTAAAAATTA 
TCCGGGTTCA 
TCCATTTTAT 
TCCTGGGGCC 
GACCCTGCCC 
GCCCAGGATC 



480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
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PATENT APPLICATION US/08/206,176 



DATE: 03/23/94 
TIME: 12:16:53 



INPUT SET: S1765.raw 



1735 
1736 
1737 
1738 
1739 
1740 
1741 
1742 
1743 
1744 
1745 
1746 
1747 
1748 
1749 
1750 
1751 
1752 
1753 
1754 
1755 
1756 
1757 
1758 
1759 
1760 
1761 
1762 
1763 
1764 
1765 
1766 
1767 
1768 
1769 
1770 
1771 
1772 
1773 
1774 
1775 
1776 
1777 
1778 
1779 
1780 
1781 
1782 
1783 
1784 
1785 



TGACCTGCTT 
GGTCCAGGTC 
TCTCCCTCCT 
CTCTCTCTGG 
CTGAAGGGCA 
GTCCCGAGAT 
TCCCTGGCTG 
GTCACAGCCT 
TACCTATGGG 
TCCCAGTCCG 
CCCCCCCGCT 
CTCAGTCTAG 
CGTTTTCTCT 
AGCTCACATC 
TGTGAGTTCT 
GGAACCTGGG 
TTGCAAGAAT 
TCTTTCCATC 
CTCTTTCCTG 
AGCCGGAGCT 
CCCCCTGGAA 
ATCAGGAGAC 
ATACGCCCAC 
TTTGCATCCC 
GTGGATTCAG 
CAACACACCC 



CTGAGGAGCA 
CCCTCCCAGG 
ACAGTGGTCA 
ATGGTATTCT 
TGGTTTGTGG 
TGGTGGCACC 
ACCTCCCTCC 
CTCGCATCTA 
GAAGGGCTTC 
GACGTCAAAC 
GCTGCCTCCA 
TCTCTGCCTC 
CACAAGGACA 
TGTAAAGACC 
GGGGCGACAT 
GTTTATCGTA 
TAAAGGTGCT 
ACAGGAAAGG 
CTACAGTCAC 
AGTTTAGGAG 
AGACCCTACA 
ATCCCCGCTA 
CCCACCCCTG 
CTTTGCCAGA 
CCAGGAGGCC 
AGCAC CAGCA 



GGGGTTTTGG 
CCCCCCTGTC 
GTCCCAGCTG 
CTGGAAGCTG 
TCACGGTTCA 
CAGATTTCCT 
TCCCTGCATC 
ACGCCGGTGT 
TCACTGCAGT 
AGGCCGAGCT 
GCTCCTGGTG 
CGTGTTCACA 
CCGGACATTA 
TAGATTCCAA 
CCTTCAACCC 
AAACCCAGGT 
AATACAGATC 
GGGGCCTGGG 
CAACAGTCTC 
ACCCCTGAAC 
GTTCAGGGGG 
TCAGGAGATT 
TGATGAGCAG 
GGCACAAGGC 
TGTCCTGCAC 
TTCCCGCTGC 



CAGGACGGGG 
TGGGGCAGCC 
CCCCAGGCCA 
AAGGTTCCTG 
CAGGAACTTG 
AAGCTCGCTG 
ACCCAGTTCT 
CCAAACCACC 
GGTGCCCCCC 
CCCTCCAGAG 
CCGCACCCTT 
CGCCTTCTCC 
GATTAGCCCC 
ACAAGATTCC 
CATCACAGCT 
TCTTCATGAA 
AGGGCAAGGA 
GGCGGCTGGA 
TCTGGGAAGG 
CTCCACCCAA 
GAAGAGGGGC 
CCCCCACCTT 
TTTAGTCACT 
ACCCACAGCC 
CCTCCCTGCT 
TCCTGAGGTC 



AGATGCTGAG 
CTTGGGAAAG 
GAGCTGCTTT 
AAGTTATGAA 
GGAGACCCTG 
GGGAACAGGG 
GAAAGCAGAG 
CGTGCTGGTG 
GTCCCCTCTG 
GCTCCAGGGA 
GAGCCTGATC 
CCATGTCCCC 
TGTTCCAGCC 
AACCTGAAGT 
TGCAGTTCAT 
ACACTGAGCT 
CTGAAGCTGG 
GGTCTGCTCC 
AAACCAGAGG 
GATGCTGACC 
TGACCCGCCA 
GCTCCCGTTC 
TAGAATGTCA 
TGCTGGGTAC 
CGGGCCCCCT 
TGCAGGCAGC 



AGCCGACGGG 
ATTGCCCCAG 
ATTTCCGTCT 
TAGCTTTGCC 
CAGCTCAGAC 
CGCTTGTTTC 
CGGTGCTGGG 
TTCGGGGGGC 
AGATCAGAAG 
GGGATCCTTG 
TTGTAGACGC 
TCCGTGTCCC 
TCACCTGAAC 
TCCCGGTGGA 
CGCAAAACAT 
TCGAGGCTTG 
CTAAGCCTCC 
CGTGAGTGAG 
CCAGAGAGCA 
AGCCAGCGGG 
GGTCCCTGCT 
CCCTATCCCA 
ACTGAAGGCT 
CGACGCCCAT 
CTGTGCTCAG 
TCGCTGTAGC 



1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
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RAW SEQUENCE LISTING 

PATENT APPLICATION US/08/206,176 



DATE: 03/23/94 
TIME: 12:17:00 



INPUT SET; S1765.raw 



1786 
1787 
1788 
1789 
1790 
1791 
1792 
1793 
1794 
1795 
1796 
1797 
1798 
1799 
1800 
1801 
1802 
1803 
1804 
1805 
1806 
1807 
1808 
1809 
1810 
1811 
1812 
1813 
1814 
1815 
1816 
1817 
1818 
1819 
1820 
1821 
1822 
1823 
1824 
1825 
1826 
1827 
1828 
1829 
1830 
1831 
1832 
1833 
1834 
1835 
1836 



CTGAGCGGTG 
GGTTGGGCCC 
CGTGCCCCCG 
ATTGCCATTG 
CCGACCAGGT 
AGAGGGTGGG 
GCTGGTGCCC 
CCTGTCCTTG 
CCCAGAGTCC 
CCCAGAGGGG 
GGAAGCCACC 
GGCCCAGCCC 
CAGCCCTCCA 
AAGTGCCTCC 
ACCCAGACCA 
TGCAGGGCGG 
CCATCAGTCA 
ACTGTCTTGC 
CCTGCGGCCG 
TTCCTCTTCC 
CGACCTTTGT 
TTAGTGGGAC 
CCAGCCATGT 
TCTTTGGGTT 
ATCTCAGGGC 



TGGAGGGAAG 
TGTGGGCCTG 
CCGCAGGGGT 
CCATTTTTGC 
CCTCCCTCGG 
TGACTGCAGA 
AAGGCAGAGG 
TCTAAGAGGC 
AGACACCCAC 
ACTTCCTGCT 
CCGGGGCCTG 
GGGCTGGCTG 
CTCCCTGCAG 
TGCTTGCCCT 
TGAAAGGCCT 
GCAGGGGAGC 
GCTAGGGCCA 
ATTCTGGAGG 
CTCTCTGGGG 
TGTGAGGCCA 
CATCTCTTTA 
ACAGTTCAGC 
CTCCCCAAGA 
CAGTGTGAGT 
TGCCCAGGCC 



TGTCCTGGGA 
CCCATCCCAC 
CAGGTCACTT 
TACCCTAACT 
AGCTCGACCT 
GATCCCTTCA 
CCACCCTCCA 
TGACCCCGGA 
CTGTGCCCCC 
TGGCCTTGGA 
AGGATGAGCC 
GCCTGCATGC 
AGCTCAGAAG 
GGGCCTGGCC 
GGACATCCAG 
TGGGCCTCAG 
CCTGACAAAT 
CTGGAAGCCC 
AGCAGACGGC 
CCAGGCCTGC 
AAGGCCATGT 
CCCTAAAAGA 
TCCAAATGTT 
CTGGGGAGAG 
GGGGTGGGAC 



GATTTAAAAT 
GTGCCTGCAT 
TCCCGTCCTG 
GGGCAGCAGG 
GAACCCCATG 
CCCAAGGCCA 
GGACACACCT 
AGTGTTCCTG 
GCTTCTGGGG 
TGGAAGAAGG 
AAGTGGGATT 
CTCCTGTATA 
CACGACCCCA 
CTCGCCTGTG 
AAGGTTCGAG 
AGAGCCAAGA 
CCCCGCTGGG 
AAGATCCAGG 
CGTCTTCTCC 
TGGAAACACG 
CTCCAGAGTC 
GTCTCTCTGC 
GCTACATGTG 
CATTCCCCAG 
AGAGAGCCCA 



GTGAGAGGCG 
TAGCCCCAGT 
GGGTTATTAT 
TGCTTGCAGA 
TCACCCTTGC 
CGGTCACATG 
GTCCCCAGTG 
GCACTGGCAG 
TCTACCAGGA 
CCTCCTATTG 
CCGGGAACCG 
AGGCCCCAAG 
GGGATATCCC 
GCGTCCAGGC 
GGTTGGCCGG 
GAGGCTGTGA 
GCAGCTTCAA 
TGTTGGCAGG 
AGTCCTCTGC 
CCTGCCTGCG 
ATGTGTTGAA 
CCCTCAAATT 
GGGGGGCTCA 
GGTGCAGAGT 
CTGTGGGGCT 



GGAGGTGGGA 
GCTGCTCAGC 
GACTCTTGTC 
GCCCTCGATA 
CCCAGCCTGC 
GTTTGGAGGA 
CTGGCTCTGA 
CCAGCCTGGA 
ACCGTCTAGG 
TCCTCGTAGA 
CGTGGCTGGG 
CCTGCTGTCT 
TGCAGCCATG 
CATCATCGTC 
GTGGGTGAGT 
CGTTGGGTTC 
CCAGGCGTTC 
GCTGGCTTCT 
GCGCCCTGAT 
CAGCTTCACA 
GTTCTGGGGG 
TTCCCCACCT 
TCTGGGTCCC 
TGGGGGGAGT 
GGGGGCCCCT 



3540 
3600 
3660 
3720 
3780 
3840 
3900 
3960 
4020 
4080 
4140 
4200 
4260 
4320 
4380 
4440 
4500 
4560 
4620 
4680 
4740 
4800 
4860 
4920 
4980 
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RAW SEQUENCE LISTING 

PATENT APPLICATION US/08/206,176 



DATE: 03/23/94 
TIME: 12:17:06 



INPUT SET: S1765.raw 



1837 
1838 
1839 
1840 
1841 
1842 
1843 
1844 
1845 
1846 
1847 
1848 
1849 
1850 
1851 
1852 
1853 
1854 
1855 
1856 
1857 
1858 
1859 
1860 
1861 
1862 
1863 
1864 
1865 
1866 
1867 
1868 
1869 
1870 
1871 
1872 
1873 
1874 
1875 
1876 
1877 
1878 
1879 
1880 
1881 
1882 
1883 
1884 
1885 
1886 
1887 



TCCCACCCCC 
GGCTATGGCG 
CGTGGAGGAG 
GGCGTCTCTC 
GGTGCAGGAG 
GGCGCTCCAC 
GATGCTTCAG 
GGGGAGATAA 
ATTCTGAATT 
TCATCATTTC 
GCAGTGTGGG 
CTCAAGCAGT 
AGCTGGATGG 
GCCTGTGGTC 
CATCCACCCG 
CTGGCTCCTA 
CCTCAGTTCA 
GGGAGAACGG 
TGTTCAAGAT 
GACTGTGGAC 
ATGAAGACAC 
GGGGCGGTGG 
CTCTCTTTGC 
GTGCCCTCCC 
CCCCAGCCCG 
TTCACTTGTT 



AGAGTGCAAC 
GCCAGCGACA 
CTGAAGCCCA 
CCCAACATGG 
GGACCAGGGC 
CCAAGGCTGC 
AACATCATCA 
AATCCTCTGA 
CCCTGTTAGT 
TAGTCTGAAC 
GATAGGCCCG 
TCCCGCTACC 
GTGTGGGGGC 
AAGGCTCTCC 
GTCCCCTGTG 
TGCCCATGCC 
TCCTGATGAA 
CGAGTGTGCT 
CGATGGTGAG 
AGGTTCAGGG 
TGCCTTGACA 
CCATGGGCAG 
TGGGGGGCGG 
AGGGGTAACC 
CTCCACAGCT 
CGTCCTAAAT 



TCAAGGTCCC 
TCTCCCTGCT 
CCCCCGAGGG 
AACCCCCACT 
CCCAGGGCTG 
CCACCCAGGG 
AACAAATGAA 
AGTGGAAATG 
CTGAGGATTA 
ACCTCAGTAT 
TGTGAAGGCT 
AGCCCTGTCC 
AGAGATGGGG 
CTGACCTTTT 
GCCTGAGGTG 
ACCCCCCTCC 
AATGGTCCAT 
CAGAAGAAGA 
TCCGGGTCCC 
GGCTGGCGTC 
CCTGCTTCAC 
GTCCCGGCTG 
GCGGTGCTCT 
GAGAGCCGTT 
CCTTCATCTC 
CCGAGATGAT 



TCTCCAGGTG 
GGATGCCCAG 
CAACCTGGAG 
CCCCAGGGCT 
GGGAAGAGGG 
CTTTTTTTTT 
CATAAAACAT 
CATAGCAAAG 
CAAGTGTATT 
CTAAAATGAA 
GCTGGGAGGC 
ACCTCAGACG 
ACCTGAACCC 
CTCTCTGGCT 
ACAGTGAGTG 
AGCCCTCCTG 
GCCAATGGCT 
TTATTGCAGA 
TGGGGGACAC 
GGGCCCTGGG 
TTGCCTCCCC 
GCGGGCTAAC 
GGGCCCTCAG 
GCCCACTCCA 
CTGGAGACAA 
AAAGCTTCGA 



GCGGGGACTT 
AGTGCCCCCC 
ATCCTGCTGC 
GTGGACCCCC 
CTCAGAGTTT 
TTTTAAACTT 
TCATTTTTGT 
ATACATACAA 
TGAGCAACAG 
CAAGAAGTCC 
AGCAGACCTG 
GGGGTCAGGG 
CAGGGCTGCC 
TCATCTGACT 
CGCCGAGGCT 
GGCCAGCTTC 
CAGAAAGCAG 
AAAAACCAAG 
CCACCACCCC 
ATGCTAAGGG 
TGCCACCTGC 
CCACCAGGGT 
GCTGAGCTCA 
GGGGCCCAGG 
ACTCTGTCCG 
GGGGGGGTTG 



GGCACTCCTT 
TGAGAGTGTA 
AGAAATGGTG 
CGGGGGGTGG 
ACTGGTACCC 
TTATTAATTT 
TTACTTGGAA 
TGAGGCAGGT 
AGAGACATTT 
TGGAAACGAA 
GGTCTTCGGG 
TGCAGGAGAG 
TTTTGGGGGT 
TCTCCTGGCC 
AGTTGGCCAG 
TGCCCCTGGC 
CTGTCTTTCA 
ATCCCTGCGG 
CGCCCCCGGG 
ACTGGTGGTG 
CCGGGGCCTT 
GACACCCGAG 
GGAGGTACCT 
TGCCCCACGA 
CCCTCGCTCA 
GGGTTCCATC 



5040 
5100 
5160 
5220 
5280 
5340 
5400 
5460 
5520 
5580 
5640 
5700 
5760 
5820 
5880 
5940 
6000 
6060 
6120 
6180 
6240 
6300 
6360 
6420 
6480 
6540 
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RAW SEQUENCE LISTING 

PATENT APPLICATION US/08/206,176 



DATE: 03/23/94 
TIME: 12:17:13 



INPUT SET: S1765.raw 



1888 
1889 
1890 
1891 
1892 
1893 
1894 
1895 
1896 
1897 
1898 
1899 
1900 
1901 
1902 
1903 
1904 
1905 
1906 
1907 
1908 
1909 
1910 
1911 
1912 
1913 
1914 
1915 
1916 
1917 
1918 
1919 
1920 
1921 
1922 
1923 
1924 
1925 
1926 
1927 
1928 
1929 
1930 
1931 
1932 
1933 
1934 
1935 
1936 
1937 
1938 



AGGGCTGCCC 
CTCTGACTGG 
CATCCAGCTG 
CAGCCGGGGC 
TGCCCCGACA 
GCCCCACCCG 
GCCTCAGACT 
CCAGAGTTGA 
GGAAGCACCA 
TGGTGACCCC 
TCCCCCCCCG 
CAAAGTCCTT 
TGCTGAGCCC 
CAGGGAGACC 
CCAGGAGGAG 
GGAGCCAGTC 
GAGACGGGTG 
TGAATCTAAC 
TACAAAGCCA 
CCGTAGCAGT 
GAGTGGAGGC 
AGGTGGGGGG 
GGGAGGGGAA 
AGACCACCCT 
GTGGACAACG 



TTCCGCCGGG 
AGGCCCTGCA 
CACTGGGTTT 
TGCCCACTGG 
ATGACCCCAT 
AGATCGGGGG 
GAGTGGTGAG 
CAGTGAGGGC 
TTTCAGGGGT 
GGGGGAGCCC 
TTGGAACTCA 
GTGCTGGACA 
GAGCAAAGCC 
AGCTGCGTGG 
GAGGGGTGGG 
TGCTGTGGGC 
GGCTGCAGAA 
AGCCTTTGTT 
TCTTTCAACT 
CCCACTGGGC 
TCCTGGCTGT 
CAGGCTTTCC 
CTAGGCCAAG 
GGGGAGCAGG 
AGGCCCTGGA 



CAGCCTGGGC 
CTGACTGACG 
GGGTGCTGGT 
CCTCGGTCAG 
CCTCAGGACG 
AAGCCCTATT 
TGTTCCCAAG 
TTCCTGGGCC 
GGGGGATGCC 
CGCTGGTCGT 
CTTTTCTCCC 
CCGACTACAA 
TGGCCTGCCA 
TCCTTGCTGC 
GGGTCCCTGA 
CTGTGGGTGG 
CTGTGACTGG 
ACCGGGGAGT 
ATCACATCCT 
ATTTTCAGGG 
GTCAGCCGGC 
CCCTGTGACC 
GGGGAAGGGC 
GACTGACCCC 
GAAATTCGAC 



CACATCTGCC 
CCAGGGTGCC 
CCTGCCCCCA 
GGTGAGCCCC 
CACCCCCCTT 
TCTTGACAAC 
TCCAGGAGGT 
CCATGCGCCT 
AGAGGCGCTC 
GGAGGGTGCT 
GTCTTGACCG 
AAAGTACCTG 
GTGCCTGGGT 
AACAGGGGGT 
GTCCCGCCAG 
CTGGGGACGG 
TGTGACCGTC 
TTCAATTATT 
GAAAACAAAT 
CCCCTGTGCC 
CCAGGGGGAG 
TGCAGACCCA 
AGGTGCTCTG 
CGTCCCTGCC 
AAAGCCCTCA 



CTTGGCCCCC 
CAGCCCAGGG 
AGCTGCCCGG 
AGCTGCCCCC 
CCCTTGCTGG 
TCCAGTCCCT 
GGTGGAGGGT 
GGCAGTGGCA 
CCCACCCCGT 
GGGGGCTGAC 
CGTCCAGCCT 
CTCTTCTGCA 
GGGTGCCAAC 
GGGGGGTGGG 
GAGAGAGTGG 
GGGCCAGACA 
GCGATGGGGC 
TCCCAAAATA 
GGCAGGTGAC 
AGGGGGGCGC 
GAAGGGACCC 
CTGCACTGCC 
GAGGGCAAGG 
CCATAGTCAG 
AGGCCCTGCC 



TCAGGACTCA 
TCTCTGGCGC 
ACACCACAGG 
GCTCAGGGCT 
GCAGTGTCCA 
GGGGGAGGGG 
CCTGGCGGAT 
GCAGGGAAGA 
CTTCGCCGGG 
TAGCAACCCC 
TGAATGAGAA 
TGGAAAACAG 
CCTGGCTGCC 
AGCTTGATCC 
TCGCATACCG 
CACAGGCCGG 
CGGTGGTCAC 
AGAACTCAGG 
ATTTTCTGTG 
GGGCATCGGC 
GGACAGCCAG 
CTGGGAGGAA 
GCAGACCTGC 
GACCCCGGAG 
CATGCACATC 



6600 
6660 
6720 
6780 
6840 
6900 
6960 
7020 
7080 
7140 
7200 
7260 
7320 
7380 
7440 
7500 
7560 
7620 
7680 
7740 
7800 
7860 
7920 
7980 
8040 
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193 9 CGGCTTGCCT TCAACCCGAC CCAGCTGGAG GGTGAGCACC CAGGCCCCGC CCTTCCCCAG 8100 
1940 

1941 GGCAGGAGCC ACCCGGCCCC GGGACGACCT CCTCCCATGG TGACCCCCAG CTCCCCAGGC 8160 
1942 

1943 CTCCCAGGAG GAAGGGGTGG GGTGCAGCAC CCCGTGGGGG CCCCCTCCCC ACCCCCTGCC 8220 
1944 

1945 AGGCCTCTCT TCCCGAGGTG TCCAGTCCCA TCCTGACCCC CCCATGACTC TCCCTCCCCC 8280 
1946 

1947 ACAGGGCAGT GCCACGTCTA GGTGAGCCCC TGCCGGTGCC TCTGGGGTAA GCTGCCTGCC 8340 
1948 

1949 CTGCCCCACG TCCTGGGCAC ACACATGGGG TAGGGGGTCT TGGTGGGGCC TGGGACCCCA 8400 
1950 

1951 CATCAGGCCC TGGGGTCCCC CCTGTGAGAA TGGCTGGAAG CTGGGGTCCC TCCTGGCGAC 8460 
1952 

1953 TGCAGAGCTG GCTGGCCGCG TGCCACTCTT GTGGGTGACC TGTGTCCTGG CCTCACACAC 8520 
1954 

1955 TGACCTCCTC CAGCTCCTTC CAGCAGAGCT AAGGCTAAGT GAGCCAGAAT GGTACCTAAG 8580 
1956 

1957 GGGAGGCTAG CGGTCCTTCT CCCGAGGAGG GGCTGTCCTG GAACCACCAG CCATGGAGAG 8640 
1958 

1959 GCTGGCAAGG GTCTGGCAGG TGCCCCAGGA ATCACAGGGG GGCCCCATGT CCATTTCAGG 8700 
1960 

1961 GCCCGGGAGC CTTGGACTCC TCTGGGGACA GACGACGTCA CCACCGCCCC CCCCCCATCA 8760 
1962 

1963 GGGGGACTAG AAGGGACCAG GACTGCAGTC ACCCTTCCTG GGACCCAGGC CCCTCCAGGC 8820 
1964 

1965 CCCTCCTGGG GCTCCTGCTC TGGGCAGCTT CTCCTTCACC AATAAAGGCA TAAACCTGTG 8880 
1966 

1967 CTCTCCCTTC TGAGTCTTTG CTGGACGACG GGCAGGGGGT GGAGAAGTGG TGGGGAGGGA 8940 
1968 

196 9 GTCTGGCTCA GAGGATGACA GCGGGGCTGG GATCCAGGGC GTCTGCATCA CAGTCTTGTG 9000 
1970 

1971 ACAACTGGGG GCCCACACAC ATCACTGCGG CTCTTTGAAA CTTTCAGGAA CCAGGGAGGG 9060 
1972 

1973 ACTCGGCAGA GACATCTGCC AGTTCACTTG GAGTGTTCAG TCAACACCCA AACTCGACAA 9120 
1974 

1975 AGGACAGAAA GTGGAAAATG GCTGTCTCTT AGTCTAATAA ATATTGATAT GAAACTCAAG 9180 
1976 

1977 TTGCTCATGG ATCAATATGC CTTTATGATC CAGCCAGCCA CTACTGTCGT ATCAACTCAT 9240 
1978 

1979 GTACCCAAAC GCACTGATCT GTCTGGCTAA TGATGAGAGA TTCCCAGTAG AGAGCTGGCA 9300 
1980 

1981 AGAGGTCACA GTGAGAACTG TCTGCACACA CAGCAGAGTC CACCAGTCAT CCTAAGGAGA 9360 
1982 

1983 TCAGTCCTGG TGTTCATTGG AGGACTGATG TTGAAGCTGA AACTCCAATG CTTTGGCCAC 9420 
1984 

1985 CTGATGTGAA GAGCTGACTC ATTTGAAAAG ACCCTGATGC TGGGAAAGAT TGAGGGCAGG 9480 
1986 

1987 AGGAGAAGGG GACGACAGAG GATGAGATGG TTGGATGGCA TCACCAACAC AATGGACATG 9540 
1988 

1989 GGTTTGGGTG GACTCCAGGA GTTGGTGATG GACAGGGAGG CCTGGCGTGC TACGGAAGCG 9600 



PAGE: 40 



RAW SEQUENCE LISTING 

PATENT APPLICATION US/08/206,176 



DATE: 03/23/94 
TIME: 12:17:27 



INPUT SET: S176S.raw 



1990 
1991 
1992 
1993 
1994 
1995 
1996 
1997 
1998 
1999 
2000 
2001 
2002 
2003 
2004 
2005 
2006 
2007 
2008 
2009 
2010 
2011 
2012 
2013 
2014 
2015 
2016 
2017 
2018 
2019 
2020 
2021 
2022 
2023 
2024 
2025 
2026 
2027 
2028 
2029 
2030 
2031 
2032 
2033 
2034 
2035 
2036 
2037 
2038 
2039 
2040 



GTTTATGGGG 
TACAGCAAAG 
TATTTTTATG 
ACCGTAGCCT 
GGTAGCCATT 
TATTGGCAGG 
CCCACTTAAT 
CTGCAGAGGG 
GGGCCACGTG 
TCTTTAAAAA 
TGCGATGGAC 
TGAGTCACCA 
CCTGATCAAA 
AGGTTCACCA 
CTCTTTAATC 



TCACAAAGAC 
TGGGGATTTT 
CATACCTGAA 
TCCAGGTTTC 
TCCTCCTCCA 
TGGATTCTTT 
TACCAAAGCT 
TGGTGGGGGT 
ACCCACAGTC 
AAACTGAGGT 
AGGACCTCCT 
GACACTCGGG 
GAGCAAGACC 
GAGCTGAGCT 
TGGATTTAAG 



TGAGTGACTG 
TTAGATAATA 
TGCTCAGTCA 
TTCTGTCCAC 
GGGGATCCTC 
ACCACTGTGC 
GCTCCAAGAA 
AGACTGTGAC 
CTGCAGACAG 
CTATTTTGTG 
CCCCAGGCCT 
GGTGGCCCCG 
AATGACTTCT 
GTCCTTTTGA 
GCCTACTTGC 



AACTGAGCTG 
AGAATATACA 
CTCAGTCGTA 
AGAATTCTCC 
CCGACCCAGG 
CACCAGGGAA 
AAAGCCCCTG 
CTGGGAACAC 
CCGGGTAGCT 
ACTTCGCTGC 
CAGGGGCTTC 
CCTTCAGGGT 
TAGGAGCAAG 
ACCTAAAGAC 
CCCTCAAGAG 



AACTGAATGG 
CATAACATAG 
TCTGACTCTG 
AAGGCAAGAA 
GATTGAACCG 
GCCCGTGTTA 
TGCCCTCTGA 
CCTCCCGCTT 
CTGCTCTTCA 
CGTAACTTCT 
AGGGAGCCAG 
GCTCACAGTC 
CAGACACCCA 
ACACAGCTCT 
GGAAGACAGT 



AAATGAGGTA 
TGTATACTCA 
TGACCTATGG 
TACTGGAGTG 
GCATCTCCTG 
CTCTCTATGT 
GCTTCCCGGC 
CAGGACTCCC 
AGGCTCATTA 
GAACATCCAG 
CCTTCACCTA 
TTCCCATCGT 
CAGGACACTG 
CGAAGGTTTT 
CCTGCATGTC 



CCCAGGACAG CCACTCGGTG GCATCCGAGG CCACTTAGTA TTATCTGACC GCACCCTGGA 
ATTAATCGGT CCAAACTGGA CAAAAACCTT GGTGGGAAGT TTCATCCCAG AGGCCTCAAC 
CATCCTGCTT TGACCACCCT GCATCTTTTT TTCTTTTATG TGTATGCATG TATATATATA 
TATATATTTT TTTTTTTTTC ATTTTTTGGC TGTGCTGGCT GTTCGTTGCA GTTCGGTGCG 
CAGGCTTCTC TCTAGTTTCT CTCTAGTCTT CTCTTATCAC AGAGCAGTCT CTAGACGATC 
GACGCGT 

(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 47 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



9660 
9720 
9780 
9840 
9900 
9960 
10020 
10080 
10140 
10200 
10260 
10320 
10380 
10440 
10500 

10560 
10620 
10680 
10740 
10800 
10807 
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2041 
2042 
2043 

2044 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 

2045 

2046 AATTCCGATC GACGCGTCGA CGATATACTC TAGACGATCG ACGCGTA 47 
2047 

2048 (2) INFORMATION FOR SEQ ID NO : 9 : 
2049 

2050 (i) SEQUENCE CHARACTERISTICS: 

2051 (A) LENGTH: 24 base pairs 

2052 (B) TYPE: nucleic acid 

2053 (C) STRANDEDNESS : single 

2054 (D) TOPOLOGY: linear 
2055 

2056 

2057 (vii) IMMEDIATE SOURCE: 

2058 (B) CLONE: BLGAMP3 
2059 

2060 

2061 (xi) SEQUENCE DESCRIPTION: SEQ ID NO : 9 : 

2062 

2063 TGGATCCCCT GCCGGTGCCT CTGG 24 

2064 

2065 

2066 (2) INFORMATION FOR SEQ ID NO: 10: 
2067 

2068 (i) SEQUENCE CHARACTERISTICS: 

2069 (A) LENGTH: 24 base pairs 

2070 (B) TYPE: nucleic acid 

2071 (C) STRANDEDNESS: single 

2072 (D) TOPOLOGY: linear 
2073 

2074 

2075 (vii) IMMEDIATE SOURCE: 

2076 (B) CLONE: BLGAMP4 
2077 

2078 

2079 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

2080 

2081 AACGCGTCAT CCTCTGTGAG CCAG 24 
2082 

2083 (2) INFORMATION FOR SEQ ID NO: 11: 
2084 

2085 (i) SEQUENCE CHARACTERISTICS: 

2086 (A) LENGTH: 10 base pairs 

2087 (B) TYPE: nucleic acid 

2088 (C) STRANDEDNESS: single 

2089 (D) TOPOLOGY: linear 
2090 

2091 
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2092 (vii) IMMEDIATE SOURCE: 

2093 (B) CLONE: ZC6839 
2094 

2095 

2096 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 

2097 

2098 ACTACGTAGT 10 
2099 

2100 (2) INFORMATION FOR SEQ ID NO:12: 
2101 

2102 (i) SEQUENCE CHARACTERISTICS: 

2103 (A) LENGTH: 42 base pairs 

2104 (B) TYPE: nucleic acid 

2105 (C) STRANDEDNESS : single 

2106 (D) TOPOLOGY: linear 
2107 

2108 

2109 (vii) IMMEDIATE SOURCE: 

2110 (B) CLONE: ZC6632 
2111 

2112 

2113 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:12: 

2114 

2115 CGACGCGGAT CCTACGTACC TGCAGCCATG TTTTCCATGA GG 42 
2116 

2117 (2) INFORMATION FOR SEQ ID NO:13: 
2118 

2119 (i) SEQUENCE CHARACTERISTICS: 

2120 (A) LENGTH: 21 base pairs 

2121 (B) TYPE: nucleic acid 

2122 (C) STRANDEDNESS : single 

2123 (D) TOPOLOGY: linear 
2124 

2125 

2126 (vii) IMMEDIATE SOURCE: 

2127 (B) CLONE: ZC6627 
2128 

2129 

2130 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:13: 

2131 

2132 AGGGCTTCGG CAAGCTTCAG G 21 
2133 

2134 (2) INFORMATION FOR SEQ ID NO:14: 
2135 

2136 (i) SEQUENCE CHARACTERISTICS: 

2137 (A) LENGTH: 24 base pairs 

2138 (B) TYPE: nucleic acid 

2139 (C) STRANDEDNESS: single 

2140 (D) TOPOLOGY: linear 
2141 

2142 
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2143 (vii) IMMEDIATE SOURCE: 

2144 (B) CLONE: ZC6521 
2145 

2146 

2147 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 

2148 

2149 GCCAAAGACT TACTTCCCTC TAGA 24 

2150 

2151 

2152 (2) INFORMATION FOR SEQ ID NO: 15: 
2153 

2154 (i) SEQUENCE CHARACTERISTICS: 

2155 (A) LENGTH: 3 0 base pairs 

2156 (B) TYPE: nucleic acid 

2157 (C) STRANDEDNESS : single 

2158 (D) TOPOLOGY: linear 
2159 

2160 

2161 (vii) IMMEDIATE SOURCE: 

2162 (B) CLONE: ZC6520 
2163 

2164 

2165 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:15: 

2166 

2167 GCATGAACGT CGCGTGGTGG TTGTGCTACC 30 
2168 

2169 (2) INFORMATION FOR SEQ ID NO: 16: 
2170 

2171 (i) SEQUENCE CHARACTERISTICS: 

2172 (A) LENGTH: 3 0 base pairs 

2173 (B) TYPE: nucleic acid 

2174 (C) STRANDEDNESS: single 

2175 (D) TOPOLOGY: linear 
2176 

2177 

2178 (vii) IMMEDIATE SOURCE: 

2179 (B) CLONE : ZC6519 
2180 

2181 

2182 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 

2183 

2184 ACCACGCGAC GTTCATGCTC TAAAACCGTT 30 
2185 

2186 (2) INFORMATION FOR SEQ ID NO: 17: 
2187 

2188 (i) SEQUENCE CHARACTERISTICS: 

2189 (A) LENGTH: 36 base pairs 

2190 (B) TYPE: nucleic acid 

2191 (C) STRANDEDNESS: single 

2192 (D) TOPOLOGY: linear 
2193 
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2194 

2195 (vii) IMMEDIATE SOURCE: 

2196 (B) CLONE: ZC6518 
2197 

2198 

2199 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 

2200 

2201 GCTGCGGGAT CCTACGTACT AGGGGGACAG GGAAGG 36 
2202 

22 03 (2) INFORMATION FOR SEQ ID NO: 18: 
2204 

2205 (i) SEQUENCE CHARACTERISTICS: 

2206 (A) LENGTH: 45 base pairs 

2207 (B) TYPE: nucleic acid 
22 08 (C) STRANDEDNESS: single 
22 09 (D) TOPOLOGY: linear 
2210 

2211 

2212 (vii) IMMEDIATE SOURCE: 

2213 (B) CLONE: ZC6629 
2214 

2215 

2216 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 

2217 

2218 CGACGCGAAT TCTACGTACC TGCAGCCATG AAAAGGATGG TTTCT 45 
2219 

2220 (2) INFORMATION FOR SEQ ID NO: 19: 
2221 

2222 (i) SEQUENCE CHARACTERISTICS: 

2223 (A) LENGTH: 45 base pairs 

2224 (B) TYPE: nucleic acid 

2225 (C) STRANDEDNESS: single 

2226 (D) TOPOLOGY: linear 
2227 

2228 

2229 (vii) IMMEDIATE SOURCE: 

2230 (B) CLONE: ZC663 0 
2231 

2232 

2233 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 

2234 

2235 CGACGCGAAT TCTACGTACC TGCAGCCATG AAACATCTAT TATTG 45 

2236 

2237 

2238 (2) INFORMATION FOR SEQ ID NO: 20: 
2239 

2240 (i) SEQUENCE CHARACTERISTICS: 

2241 (A) LENGTH: 21 base pairs 

2242 (B) TYPE: nucleic acid 

2243 (C) STRANDEDNESS: single 

2244 (D) TOPOLOGY: linear 
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2245 
2246 

2247 (vii) IMMEDIATE SOURCE: 

2248 (B) CLONE: ZC6625 
2249 

2250 

2251 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:20: 

2252 

2253 GTGAGATTTT CAGATCTTGT C 21 
2254 

2255 (2) INFORMATION FOR SEQ ID NO: 21: 
2256 

2257 (i) SEQUENCE CHARACTERISTICS: 

2258 (A) LENGTH: 21 base pairs 

2259 (B) TYPE: nucleic acid 

2260 (C) STRANDEDNESS: single 

2261 (D) TOPOLOGY: linear 
2262 

2263 

2264 (vii) IMMEDIATE SOURCE: 

2265 (B) CLONE: ZC6626 
2266 

2267 

2268 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21: 

2269 

2270 AAGAATTACT GTGGCCTACC A 21 
2271 

2272 (2) INFORMATION FOR SEQ ID NO:22: 
2273 

22 74 (i) SEQUENCE CHARACTERISTICS: 

2275 (A) LENGTH: 33 base pairs 

22 76 (B) TYPE: nucleic acid 

2277 (C) STRANDEDNESS: single 

2278 (D) TOPOLOGY: linear 
2279 

2280 
2281 

2282 (vii) IMMEDIATE SOURCE: 

2283 (B) CLONE: ZC6624 
2284 

2285 

2286 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:22: 

2287 

2288 GCTGCGGAAT TCTACGTACT ATTGCTGTGG GAA 33 
2289 

2290 (2) INFORMATION FOR SEQ ID NO:23: 
2291 

2292 (i) SEQUENCE CHARACTERISTICS: 

2293 (A) LENGTH: 45 base pairs 

2294 (B) TYPE: nucleic acid 

2295 (C) STRANDEDNESS: single 



♦ 
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2296 (D) TOPOLOGY: linear 

2297 

2298 

2299 (vii) IMMEDIATE SOURCE: 

2300 (B) CLONE: ZC6514 
2301 

2302 

2303 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23: 

2304 

23 05 CGACGCGGAT CCTACGTACC TGCAGCCATG AGTTGGTCCT TGCAC 45 
2306 

2307 (2) INFORMATION FOR SEQ ID NO:24: 
2308 

2309 (i) SEQUENCE CHARACTERISTICS: 

2310 (A) LENGTH: 21 base pairs 

2311 (B) TYPE: nucleic acid 

2312 (C) STRANDEDNESS : single 

2313 (D) TOPOLOGY: linear 
2314 

2315 

2316 (vii) IMMEDIATE SOURCE: 

2317 (B) CLONE: ZC6517 
2318 

2319 

2320 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24: 

2321 

2322 GTCTCTGGTA GCAACATACT A 21 

2323 

2324 

2325 (2) INFORMATION FOR SEQ ID NO: 25: 
2326 

2327 (i) SEQUENCE CHARACTERISTICS: 

2328 (A) LENGTH: 22 base pairs 

2329 (B) TYPE: nucleic acid 

2330 (C) STRANDEDNESS: single 

2331 (D) TOPOLOGY: linear 
2332 

2333 

2334 (vii) IMMEDIATE SOURCE: 

2335 (B) CLONE: ZC6516 
2336 

2337 

2338 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25: 

2339 

2340 GGGTTTCTAG CCCTACTAGT AG 22 
2341 

2342 (2) INFORMATION FOR SEQ ID NO: 26: 
2343 

2344 (i) SEQUENCE CHARACTERISTICS: 

2345 (A) LENGTH: 22 base pairs 

2346 (B) TYPE: nucleic acid 



9 
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2347 (C) STRANDEDNESS: single 

2348 (D) TOPOLOGY: linear 
2349 

2350 

2351 (vii) IMMEDIATE SOURCE: 

2352 (B) CLONE: zc6515 
2353 

2354 

2355 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 26: 

2356 

2357 GGGTTTCTAG CCCTACTAGT AG 22 

2358 

2359 

2360 (2) INFORMATION FOR SEQ ID NO: 27: 
2361 

2362 (i) SEQUENCE CHARACTERISTICS: 

--> 2363 (A) LENGTH: 47 base pairs 

2364 (B) TYPE: nucleic acid 

2365 (C) STRANDEDNESS: single 

2366 (D) TOPOLOGY: linear 
2367 

2368 
2369 

2370 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 27: 

2371 

2372 AAGCTACGCG TCGATCGTCT AGAGTATATC GTCGACGCGT CGATCGG 



2373 
2374 



4 » ft 
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Line 
2363 



Error 

Entered (47) and Calc. Seq. Length (0) differ 



Original Text 

(A) LENGTH: 47 base pairs 



* 
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APPLICATION NUMBER 
FILING DATE 

PRIOR APPLICATION DATA 
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Line Original Text Corrected Text 



